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© Non-human Carbonyl hydrolase mutants, DNA sequences and vectors encoding same and hosts transformed with said 
vectors, 

© Novel carbonyl hydrolase mutants derived from the 
amino acid sequence of naturally-occurring or recombinant 
non-human carbonyl hydrolases and DNA sequences encod- 
ing the same. The mutant carbonyl hydrolases, In general, 
are obtained by in vitro modification of a precursor DNA 
sequence encoding the naturally-occurring or recombinant 
carbonyl hydrolase to encode the substitution, insertion or 
deletion of one or more amino acids in the amino acid sequ- 
ence of a precursor carbonyl hydrolase. Such mutants have 
one or more properties which are different than the same 
property of the precursor hydrolase. 
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NON-HUKAN CARBONYL HYDROLASE MUTANTS, 
DNA SEQUENCES AND VECTORS ENCODING SAME 
AND HOSTS TRANSFORMED WITH SAID VECTORS 

The recent development of various in vitro techniques 
to manipulate the DNA sequences encoding 
naturally-oecuring polypeptides as well as recent 
developments in the chemical synthesis of relatively 
short sequences of single and double stranded DNA has 
resulted in the speculation that such techniques can 
be used to modify enzymes to improve some functional 
property in a predictable way. Ulmer, K.M. (1983) 
Science 219 . 666-671. The only working example 
disclosed therein is the substitution of a single 
amino acid within the active site of tyrosyl-tRNA 
synthetase (Cys3 5^Ser) which lead to a reduction in 
enzymatic activity. See Winter, G. , et al. (1982) 
Nature 299 , 756-758; and Wilkinson, A. J*. , et al. 
(1983) Biochemistry 22, 3581-3586 (Cys35-Gly mutation 
also resulted in decreased activity). 

When the same t-RNA synthetase was modified by 
substituting a different amino acid residue within the 
.active site with two different amino acids, one of the 
mutants (ThrSl^Ala) reportedly demonstrated a 
predicted moderate increase in kcat/Km whereas a 
second mutant (ThrSl-Fro) demonstrated a massive 
increase in kcat/Km which could not be explained with 
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certainty. Wilkinson, A.H., et al. (1984) Nature 307 , 
187-188 . 

Another reported example of a single substitution of 
an amino acid residue is the substitution of cysteine 
for isoleucine at the third residue of T4 lysozyme. 
Perry, L* J. , et al. (1984) Science 226 , 555-557. The 
resultant mutant lysozyme was mildly oxidized to form 
a disulfide bond between the new cysteine residue at 
position 3 and the native cysteine at position 97. 
This crosslinked mutant was initially described by the 
author as being enzymatically identical to, but more 
thermally stable than, the wild type enzyme. However, 
in a "Note Added in Proof", the author indicated that 
the enhanced stability observed was probably due to a 
chemical modification of cysteine at residue 54 since 
the mutant lysozyme with a free thiol at Cys54 has a 
thermal stability identical to the wild type lysozyme. 

Similarly, a modified dihydrof olate reductase from 
E.coli has been reported to be modified by similar 
methods to introduce a cysteine which could be 
crosslinked with a naturally-occurring cysteine in the 
reductase. Villafranca, D* E. , et al* (1983) Science 
222 , 782-788. The author indicates that this mutant 
is fully reactive in the reduced state but has 
significantly diminished activity in the oxidized 
state. In addition, two other substitutions of 
specific amino acid residues are reported which 
resulted in mutants which had diminished or no 
activity, 

EPO Publication No. 0130756 discloses the substitution 
of specific residues within B. amyloliquefaciens 
subtilisin with specific amino acids. Thus, Met222 
has been substituted with all 19 other amino acids, 
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Glyl66 with 9 different amino acids and Glyl69 with 
Ala and Ser. 

As set forth below, several laboratories have also 
reported the use of site directed mutagensis to. 
produce the mutation of more than one amino acid 
residue within a polypeptide. 

The amino-terminal region of the signal peptide of the 
prolipoprotein of the E. coli outer membrane was 
stated to be altered by the substitution or deletion 
of residues 2 and 3 to produce a charge change in that 
region of the polypeptide- Inoyye, S., et al. (1982) 
Proc. Nat. Acad. Sci. USA 79 , 3438-3441. The same 
laboratory also reported the substitution and deletion 
of amino acid redisues 9 and 14 to determine the 
effects of such substitution on the hydrophobic region 
of the same signal sequence. Inouye , S., et al. 
(1984) J. Biol. Chem. 259 , 3729-3733. 



Double mutants in the active site of tyrosyl^t-RKA 
synthetase have also been reported. Carter, P.J., et 
al. (1984) Cell 38 , 835-840. In this report, the 
improved affinity of the previously described 
Thr51+Pro mutant for ATP was probed by producing a 
second mutation in the active site of the enzyme. One 
of the double mutants, Gly35/Pro51, reportedly 
demonstrated an unexpected result in that it bound ATP 
in the transition state better than was expected from 
the two single mutants. Moreover, the author warns, 
at least for one double mutant, that it is not readily 
predictable how one substitution alters the effect 
caused by the other substitution and that care must be 
taken in interpreting such substitutions. 
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A mutant is disclosed in U.S. Patent No- 4,53 2,2 07, 
wherein a polyarginine tail was attached to the 
C-terminal residue of 0-urogastrone by modifying the 
DNA sequence encoding the polypeptide. As disclosed, 
the polyarginine tail changed the electrophoretic 
mobility of the urogastrone-polyaginine hybrid 
permiting selective purification. The polyarginine 
was subsequently removed, according to the patentee, 
by a polyarginine specific exopeptidase to produce the 
purified urogastrone. Properly construed, this 
reference discloses hybrid polypeptides which do not 
constitute mutant polypeptides containing the 
substitution, insertion or deletion of one or more 
amino acids of a naturally occurring polypeptide. 

Single and double mutants of rat pancreatic trypsin 
have also been reported* Craik, C.S., et al . (1985) 
Science £28 f 291-297. As reported, glycine residues 
at positions 216 and 226 were replaced with alanine 
residues to produce three trypsin mutants {two single 
mutants and one double mutant) . In the case of the . 
single mutants, the authors stated expectation was to 
observe a differential effect on Km. They instead 
reported a change in specificity (kcat/Km) which was 
primarily the result of a decrease in kcat. In 
contrast, the double mutant reportedly demonstrated a 
differential increase in Km for lysyl and arginyl 
substrates as compared to wild type trypsin but had 
virtually no catalytic activity. 

The references discussed above are provided solely for 
their disclosure prior to the filing date of the 
instant case, and nothing herein is to be construed as 
an admission that the inventors are not entitled to 
antedate such disclosure by virtue of prior invention 
or priority based on earlier filed applications. 
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Based on the above references, however, it is apparent 
that the modification of the amino acid sequence of 
wild type enzymes often results in the decrease or 
destruction of biological activity. 

Accordingly, it is an object herein to provide 
carbonyl hydrolase mutants which have at least one 
property which is different from the same property of 
the carbonyl hydrolase precursor from which the amino 
acid of said mutant is derived. 

It is a further object to provide mutant DNA sequences 
encoding such carbonyl hydrolase mutants as well as 
expression vectors containing such mutant DNA 
sequences. 

Still further, another object of the present invention 
is to provide host cells transformed with such vectors 
as well as host cells which are capable of expressing 
such mutants either intracellularly or 
extraceilularly . 
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gunmar v, of fcfce Jnvention 

The invention includes carbonyl hydrolase mutants, 
preferably having at least one property which is 
substantially different from the same property of the 
precursor non-human carbonyl hydrolase from which the 
5 amino acid sequence of the mutant is derived. These 
properties include oxidative stability, substrate, 
. specificity catalytic activity, thermal stability, 
alkaline stability, pH activity profile and resistance 
to proteolytic degradation. The precursor carbonyl 
10 hydrolase may be naturally occurring carbonyl 
hydrolases or recombinant carbonyl hydrolases. The 
amino acid sequence of the carbonyl hydrolase mutant 
is derived by the substitution, deletion or insertion 
of one or more amino acids of the. precursor carbonyl 
15 hydrolase amino acid sequence. 

The invention also includes mutant DNA sequences 
encoding such carbonyl hydrolase mutants. Further the 
invention includes expression vectors containing such 
20 mutant DNA sequences as well as host cells transformed 
with such vectors which are capable of expressing said 
carbonyl hydrolase mutants. 

TK-ri ^ Description n t the Drawings 

25 Figure 1 shows the nucleotide sequence of the coding 

strand, correlated with the amino acid sequence of B. 
^yinH gnefaciens subtilisin gene. Promoter (p) 
ribosome binding site (rbs) and termination (term) 
regions of the DNA sequence as well as sequences 

30 encoding the presequence (PRE) putative prosequence 

(PRO) and mature form (MAT) of the hydrolase are also 
shown. 



35 



Figure 2 is a schematic diagram showing the substrate 
binding cleft of subtilisin together with substrate. 
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Figure 3 is a stereo view of the S-l binding subsite 
of B. amvloliquefaciens subtilisin showing a lysine 
P-l substrate bound in the site in two different ways. 
Figure 3 A shows Lysine P-l substrate bound to form a 
salt bridge with a Glu at position 156* Figure 3B 
shows Lysine P-l substrate bound to form a salt bridge 
with Glu at position 166. 

Figure 4 is a schematic diagram of the active site of 
subtilisin Asp32, His64 and Ser221. 

Figures 5A and 5B depict the amino acid sequence of 
subtilisin obtained from various sources. The 
residues directly beneath each residue of B. 
amvloliquefaciens subtilisin are equivalent residues 
which (1) can be mutated in a similar manner to that 
described for B. amvloliquefaciens subtilisin, or (2) 
can be used as a replacement amino acid residue in B. 
amyloliquef aciens subtilisin. Figure 5C depicts 
conserved residues of B. amyloliquef aciens subtilisin 
when compared to other subtilisin sequences. 

Figures €A and 6B depict the inactivation of the 
mutants Met222L and Met222Q when exposed to various 
organic oxidants. 

Figure 7 depicts the ultraviolet spectrum of Ket222F 
subtilisin and the difference spectrum generated after 
inactivation by diperdodecanoic acid (DPDA) . 

Figure B shows the pattern of cyanogen bromide digests 
of untreated and DPDA oxidized subtilisin Met222F on 
high resolution SDS-pyridine peptide gels. 
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Figure 9 depicts a map of the cyanogen bromide 
fragments of Fig. 8 and their alignment with the 
sequence of subtilisin Met222F. 

Figure 10 depicts the construction of mutations 
between codons 45 and 50 of g. amvlolioruef acians 
subtilisin. 

Figure 11 depicts the construction, of mutations 
between codons 122 and 127 of fi. amvlol iouef aciens 
subtilisin. 

Figure 12 depicts the effect of DPDA on the activity 
of subtilisin mutants at positions 50 and 124 in 
subtilisin Met222F. 

Figure 13 depicts the construction of mutations at 
codon 166 of £. amyloliquef aciens subtilisin. 

Figure 14 depicts the effect of hydrophobicity of the 
P-l substrate side-chain on the kinetic parameters of 
wild-type fi. am yloliqu ef aciens subtilisin. 

Figure 15 depicts the effect of position .166 
side-chain substitutions on P-l substrate specificity. 
Figure ISA shows position 166 mutant subtilisins 
containing non-branched alkyl and aromatic side-chain 
substitutions arranged in order of increasing 
molecular volume. Figure 15B shows a series of mutant 
enzymes progressing through p- and T -branched 
aliphatic side chain substitutions of increasing 
molecular volume. 

Figure 16 depicts the effect of position 166 
side-chain volumn on log kcat/Km for various P-l 
substrates. 
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Figure 17 shows the substrate specificity differences 
between Ilel66 and wild- type (Glyl66) B- 
amvloliouefaciens subtilisin against a series of 
alpha tic and aromatic substrates. Each bar represents 
the . difference in log kcat/Km for . Ilel66 minus 
wild-type (Glyl66) subtilisin. 

Figure IS .depicts the construction of mutations at 
codon 169 of B. amyloliouef aciens subtilisin. 

Figure 19 depicts the construction of mutations at 
codon 104 of B. amvloliouefaciens subtilisin. 

I 

Figure 2 0 depicts the construction of mutations at 
codon 152 J3. amy 161 iquef ac iens subtilisin. 

Figure 21 depicts the construction of single mutations 
at codon 156 and double mutations at codons 156 and 
166 of B„ amvlol iouef aciens subtilisin. 

Figure 2 2 depicts the construction of mutations at 
codon 217 for B. amylol iouef aciens subtilisin. 

Figure 2 3 depicts the kcat/Km versus pH profile for 
mutations at codon 156 and 166 in B. amvloliouefaciens 
subtilisin. 

Figure 23A depicts the kcat/Km versus pH profile for 
mutations at codon 156 and 166 in B. amyloliquef aciens 
subtilisin. 

Figure 24 depicts the kcat/Km versus- pH profile for 
mutations at codon 222 in B* amvloliquaf aciens 
subtilisin. 
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Figure 34 depicts the strategy for constructing 
plasinids containing random cassette mutagenesis over 
residues 197 through 22 B. 

Figure 35 depicts the oligodeoxynucleotides used for 
random cassette mutagenesis over residues 197 through 
228. 

Figure 36 depicts the construction of mutants at codon 
204. 

Figure 37 depicts the oligodeoxynucleotides used for 
synthesizing mutants at codon 204. 

Detailed Description 

The inventors have discovered that various single and 
multiple in vitro mutations involving the 
substitution, deletion or insertion of one or more 
amino acids within a non-human carbonyl hydrolase 
amino acid secjuence can confer advantageous properties 
to such mutants when compared to the non-mutated 
carbonyl hydrolase . 

Specifically, B. amvloliouef aciens subtilisin, an 
alkaline bacterial protease, has been mutated by 
modifying the DNA encoding the subtilisin to encode 
the substitution of one or more amino acids at various 
amino acid residues within the mature form of the 
subtilisin molecule. These in vitro mutant 
subtil isins have at least one property which is 
different when compared to the same property of the 
precursor subtilisin. These modified properties fall 
into several categories including: oxidative 
stability, substrate specificity, thermal stability, 
alkaline stability, catalytic activity, pH activity 
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profile, resistance to proteolytic degradation, Kb, 
kcat and Km/kcat ratio. 

Carbonyl hydrolases are enzymes which hydrolyze 

0 

5 I 

compounds containing C-X bonds in which X is oxygen or 
nitrogen. They include naturally-occurring carbonyl 
hydrolases and recombinant carbonyl hydrolases. 
Naturally occurring carbonyl hydrolases principally 
10 include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subtil is ins or metalloproteases. 
Peptide hydrolases include a -aminoacy lpeptide 
hydrolase, peptidylamino-acid hydrolase, acylamino 
hydrolase, serine carboxypeptidase, metallocarboxy- 
15 peptidase, thiol proteinase, carboxylproteinase and 
metalloproteinase. Serine, metallo, thiol and .acid 
proteases are included, as well as endo and exo- 
proteases . 

20 "Recombinant carbonyl hydrolase" refers to a carbonyl 
hydrolase in which the DNA sequence encoding the 
naturally occurring carbonyl hydrolase is modified to 
produce a mutant DNA sequence which encodes the 
substitution, insertion or deletion of one or more 

25 amino acids in the carbonyl hydrolase amino acid 
sequence. Suitable modification methods are disclosed 
herein and in EPO Publication No. 013 0756 published 
January 9, 1985. 

30 • Subtilisins are bacterial carbonyl hydrolases which 
generally act to cleave peptide bonds of proteins or 
peptides. As used herein, "subtilisin" means a 
naturally occurring subtilisin or a recombinant 
subtilisin. A series of naturally occurring 

35 subtilisins is known to be produced and often secreted 
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by various bacterial species. Amino acid sequences of 
the members of this series are not entirely 
homologous. However, the subtilisins in this series 
exhibit the same or similar type of proteolytic 
activity. This class of serine proteases shares a 
common amino acid sequence defining a catalytic triad 
which distinguishes them from the chymotrypsin related 
class of serine proteases. The subtilisins and 
chymotrypsin related serine proteases both have a 
catalytic triad comprising aspartate, histidine and 
serine. In the subtilisin related proteases the 
relative order of these amino acids, reading from the 
amino to carboxy terminus is aspartate-histidine- 
serine. In the chymotrypsin related proteases tne 
relative order, however is histidine-aspartate-serine. 
Thus, subtilisin herein refers to a serine protease 
having the catalytic triad of subtilisin related 
proteases, 

"Recombinant subtilisin" refers to a subtilisin in 
which the DNA sequence encoding the subtilisin is 
modified to produce a mutant DNA sequence which 
encodes the substitution, deletion or insertion of one 
or more amino acids in the naturally occurring 
subtilisin amino acid sequence. Suitable methods to 
produce such modification include those disclosed 
herein and in EPO publication No. 0130756, For 
example, the subtilisin multiple mutant herein 
containing the substitution of methionine at amino 
acid residues 50, 124 and 22 2 with phenylalanine, 
isoleucine and glutamine, respectively, can be 
considered to be derived from the recombinant 
subtilisin containing the substitution of glutamine at 
residue 222 (Q222) disclosed in EPO Publication No, 
0130756. The multiple mutant thus is produced by the 
substitution of phenylalanine for methionine at 
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residue 50 and ieoleucine for Methionine et residue 
124 in the Q222 recombinant subtilisin. 

"Carbonyl hydrolases" and their gene. may be obtained 
from »ny proc.ryotic and euoaryotic 
suitable exiles of procaryotio organises include 
gram negative organisms such a. £. SSli « ps.udomonas 
and gram positive bacteria such as micrococcus or 
bacillus. Examples of euoaryotic organisms from which 
carbonyl hydrolase and their genes may be obtained 
Include yeast such as S. seasimiae. fungr such as 
Aspergillus sp., and non-human mar-alien source, such 
.. for example. Bovine sp. from which the gene 
encoding the carbonyl hydrolase chymosin can be 
obtained. As with subtili.ins, a series of carbonyl 
hydrolase, can be obtained from various related 
specie, which have amino acid ..guences which are not 
entirely homologous between the member, of thst series 
but which nevertheless exhibit the same or similar 
typs of biological activity. Thus, non-human carbonyl 
type oi » has . functional definition 

hydrolase as used herein has a luncci 
which refers to carbonyl hydrolase, which are 
associate, dir.ctly or indirectly, with prccaryctic 
and non-human eucaryotic .ourees. 

A "carbonyl hydrolase mutant" has an amino acid 
segu.nce which is derived from the amino acid sequence 
ef a non-human "precursor carbonyl hydrolase". The 
precursor carbonyl hydrolases include naturally- 
occurring carbonyl hydrol nd recombinant carbonyl 

•hydrolases. The amino acid sequence of the carbonyl 
hydrolase mutant is "derived" from the precursor 
hydrolase amino acid sequence by the ^u-on 
deletion or insertion of one or more ^ttLo^ol 
the precursor amino acid sequence, such modification 
Is of the "precursor DNA sequence" which encodes the 
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amino acid sequence of the precursor carbonyl 
hydrolase rathern than manipulation of the precursor 
carbonyl hydrolase per se. Suitable methods for such 
manipulation of the precursor DNA sequence include 
methods disclosed herein and in EPO Publication No. 
0130756. 

Specific residues of B . amylol iguef aciens subtilisin 
are identified for substitution, insertion or 
deletion. These amino acid position numbers refer to 
those assigned to the B. amvloliouef aciens subtilisin 
sequence presented in Fig. 1. The invention, however, 
is not limited to the mutation of this particular 
subtilisin but extends to precursor carbonyl 
hydrolases containing amino acid residues which are 
"equivalent" to the particular identified residues in 
B. amvloliouef aciens subtilisin* 

A residue (amino acid) of a precursor carbonyl 
hydrolase is equivalent to a residue of B. 
amyloliquef aciens subtilisin if it is either 
homologous (i.e., corresponding in position in either 
primary or , tertiary structure) or analagous to a 
specific residue or portion of that residue in B. 
amyloliouef aciens subtilisin (i.e., having the same or 
similar functional capacity to combine, react, or 
interact chemically) . 

In order to establish homology to primary structure, 
the amino acid sequence of a precursor carbonyl 
hydrolase is directly comparted to the B. 
amyloliquef aciens subtilisin primary sequence and 
particularly to a set of residues known to be 
invariant in all subtilisins for which sequence is 
known (Figure 5C) . After aligning the conserved 
residues, allowing for necessary insertions and 
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deletions in order to maintain alignment (i.e., 
avoiding the elimination of conserved residues through 
arbitrary deletion and insertion) , the residues 
equivalent to particular amino acids in the primary 
sequence of £. amvloliquef aciens subtilisin are 
defined. Alignment of conserved residues preferably 
should conserve 100% of such residues. However , 
alignment of greater than 75% or as little as 50% of 
conserved residues is also adequate to define 
equivalent residues. Conservation of the catalytic 
triad, Asp32/His64/Ser22i should be maintained. 

For example, in Figure 5A the amino acid sequence of 
subtilisin from B. amyloiiguef aciens £. subtilisin 
var. 1168 and £. ' lichenformis (carlsbergensis) are 
aligned to provide the maximum amount of homology 
between amino acid sequences. A comparison of these 
sequences shows that there are a number of conserved 
residues contained in each sequence. These residues 
are identified in Fig. 5C. 

These conserved residues thus may be used to define 
the corresponding equivalent amino acid residues of 
amyloiiguef aciens subtilisin in other carbonyl 
hydrolases such a© thermitase derived from 
Thermoactinomyces. These two particular sequences are 
aligned in Fig. 5B to produce the maximum homology of 
conserved residues. As can be seen there are a number 
of insertions and deletions in the thermitase sequence 
as compared to J&. amvloliouef aciens subtilisin. Thus, 
in thermitase the equivalent amino acid of Tyr217 in 
£. amYloliquefaciens subtilisin is the particular 
lysine shown beneath Tyr217. 



In Fig, 5A, the equivalent amino acid at position 217 
in fi. amyloiiguef aciens subtilisin is Tyr. Likewise, 
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in B. subtilis subtilisin position 217 is also 
occupied by Tyr but in £. lichenif ormts position 217 
is occupied by Leu, 

Thus, these particular residues in thermitase, and 
subtilisin from B. subtilisin and B. lichenif ormis may 
be substituted by a different amino acid to produce a 
mutant carbonyl hydrolase since they are equivalent in 
primary structure to Tyr2 17 in B. amyloliquef aciens 
subtilisin. Equivalent amino acids of course are not 
limited to those for Tyr217 but extend to any residue 
which is equivalent to a residue in B. f amvlolique - 
faciens whether such residues are conserved or not. 

Equivalent residues homologous at the level of 
tertiary structure for a precursor carbonyl hydrolase 
whose tertiary structure has been determined by x-ray 
crystallography, are defined as those for which the 
atomic coordinates of 2 or more of the main chain 
atoms of a particular amino acid residue of the 
precursor carbonyl hydrolase and B. amyloliquef aciens 
subtilisin (N on N, CA on CA, C on C, and 0 on 0) are 
within 0.13nm and preferably O.lnm after alignment. 
Alignment is achieved after the best model has been 
oriented and positioned to give the maximum overlap of 
atomic coordinates of non-hydrogen protein atoms of 
the carbonyl hydrolase in question to the £. 
amvlol icruef aciens subtilisin. The best model is the 
crystallographic model giving the lowest R factor for 
experimental diffraction data at the highest 
resolution available. 

2|Fo(h)|-|Fc(h)1 
R factor = ^ 2 |Fo(h) | 
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Equivalent residues which are functionally analogous 
to a specific residue of B_. rnnylol jquef aciens 
subtilisin are defined as those amino acids of the 
precursor carbonyl hydrolases which may adopt a 
conformation such that they either alter, modify or 
contribute to protein structure, substrate binding or 
catalysis in a manner defined and attributed to a 
specific residue of the B, amyloliguef aciens 
subtilisin as described herein. Further, they are 
those residues of the precursor carbonyl hydrolase 
(for which a tertiary structure has been obtained by 
x-ray crystallography) , which occupy an analogous 
position to the extent that although the main chain 
atoms of the given residue may not satisfy the 
criteria of equivalence on the basis of occupying a 
homologous position, the atomic coordinates of at 
least two of the side chain atoms of the residue lie 
with 0.13nm of the corresponding side chain atoms of 
2, amy lol jquef aciens subtilisin. The three 
dimensional structures would be aligned as outlined 
above. 

Some of the -residues identified for substitution, 
insertion or deletion are conserved residues whereas 
others are not. In the case of residues which are not 
conserved, the replacement of one or more amino acids 
is limited to substitutions which produce a mutant 
which has an amino acid sequence that does not 
correspond to one found in nature. In the case of 
conserved residues, such replacements should not 
result in a naturally occurring sequence. The carbonyl 
hydrolase mutants of the present invention include the 
mature forms of carbonyl hydrolase mutants as well as 
the pro- and prepro- forms of such hydrolase mutants. 
The prepro-forms are the preferred construction since 
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this facilitates the expression, secretion and 
maturation of the carbonyl hydrolase mutants, 

"Expression vector" refers to a DNA construct 
containing a DNA sequence which is operably linked to 
a suitable control sequence capable of effecting the 
expression of said DNA in a suitable host. Such 
control sequences include a promoter to effect 
transcription", an optional operator sequence to 
control such transcription, a sequence encoding 
suitable mRNA ribosome binding sites, and sequences 
which control termination of transcription and 
translation. The vector may be a plasmid, a phage 
particle, or/simply a potential genomic insert. Once 
transformed into a suitable host, the vector may 
replicate and function independently of the host 
genome, or may, in some instances, integrate into the 
genome itself. In the present specification, 
"plasmid" and "vector" are sometimes used 
interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the 
invention is intended to include such other forms of 
expression vectors which serve equivalent functions 
and which are, 'or become, known in the art. 

The "host cells" used in the present invention 
generally are procaryotic or eucaryotic hosts which 
preferably have been manipulated by the methods 
disclosed in EPO Publication No. 0130756 to render 
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them incapable of secreting enzymatically active 
endoprotease. A preferred host cell for expressing 
subtilisin is the Bacillus strain BG203 6 which is 
deficient in enzymatically active neutral protease and 
alkaline protease (subtilisin) . The construction of 

strain BG2 036 is described in detail in EPO Publicatin 
No. 0130756 and further described by Yang, M.Y., et 

al. (1984) 3 - lacterlsL: jL6£, 15-21. Other host cells 

for expressing subtilisin include Bacillus subtil is 

1168 (EPO Publication No. 0130756} . 

Host cells are transformed or transfected with vectors 
constructed using recombinant DNA techniques. Such 
transformed host cells are capable of either 
replicating vectors encoding the carbonyl hydrolase 
mutants or expressing the desired carbonyl hydrolase 
mutant. In the case of vectors which encode the pre 
or prepro form of the carbonyl hydrolase mutant, such 
mutants, when expressed, are typically secreted from 
the host cell into the host cell medium. 

"Operably linked" when describing the relationship 
between two DNA. regions simply means that they are 
functionally related to each other. For example, a 
preseguence is operably linked to a peptide if it 
functions as a signal sequence, participating in the 
secretion of the mature form of the protein most 
probably involving cleavage of the signal sequence. A 
promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence? a ribosome 
binding site is operably linked to a coding sequence 
if it is positioned so as to permit translation. 



The genes encoding the naturally-occurring precursor 
carbonyl hydrolase may be obtained in accord with the • 
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general methods described herein in EPO Publication 
No. 0130756. 

Once the carbonyl hydrolase gene has been cloned, a 
number of modifications are undertaken to enhance the 
use of the gene beyond synthesis of the naturally- 
occurring precursor carbonyl hydrolase. Such 
modifications include the production of recombinant 
carbonyl hydrolases as disclosed in EPO Publication 
No, 0130756 and the production of carbonyl hydrolase 
mutants described herein. 

The carbonyl hydrolase mutants of the present 
invention may be generated by site specific 
mutagenesis (Smith, M. (1985) Ann, Rev. Genet. 423 ; 
Zoeller, M.J., et al. (1982) Nucleic Acid Res. 10 , 
6487-6500) , cassette mutagenesis (EPO Publication No. 
0130756) or random mutagenesis (Shortle, D., et al. 
(1985) Genetics , 110 , 539; Shortle, D., et al. (1986) 
Proteins: Structure, Function and Genetics , 1, 81; 
Shortle, D. (1986) J. Cell. Biochem , 30 , 281; Alber, 
T. , et al. (1985) Proc. Natl. Acad, of Sci. , 82 , 747? 
Matsumura, M. , et al. (1985) J. Biochem. , 260 , 15298; 
Liao, H. f et al. (1986) Proc. Natl. Acad, of Sci. , 83 
576) of the cloned precursor carbonyl hydrolase. 
Cassette mutagenesis ind the random mutagenesis method 
disclosed herein are preferred. 

The mutant carbonyl hydrolases expressed upon 
transformation of suitable hosts are screened for 
enzymes exhibiting one or more properties which are 
substantially different from the properties of the 
precursor carbonyl hydrolases, e.g., changes in 
substrate specificity, oxidative stability, thermal 
stability, alkaline stability, resistance to 



10 



15 



20 



25 



30 



0251446 

-22- 

proteolytic degradation, pH-activity profiles and the 
like. 

A change in substrate specificity is defined as. a 
difference between the kcat/Km ratio of the precursor 
carbonyl hydrolase and that of the hydrolase mutant. 
The kcat/Km ratio is a measure of catalytic 
efficienty. Carbonyl hydrolase mutants with increased 
or diminished kcat/Km ratios are described in the 
examples. Generally, the objective will be to secure 
a mutant having a greater (numerically large) kcat/Km 
ratio for a given substrate, thereby enabling the use 
of the enzyme to more efficiently act on a target 
substrate. A substantial change in kcat/Km ratio is 
preferably at least 2 -fold increase or decrease. 
However, smaller increases or decreases in the ratio 
(e.g., at least 1.5-fold) are also considered 
substantial. An increase in kcat/Km ratio for one 
substrate may be accompanied by a reduction in kcat/Km 
ratio for another substrate. This is a shift in 
substrate specificity, and mutants exhibiting such 
shifts have utility where the precursor hydrolase is 
undesirable, .e.g. to prevent undesired hydrolysis of a 
particular substrate in an admixture of substrates. 
Km and kcat are measured in accord with known 
procedures, as described in EPO Publication No. 
0130756 or as described herein. 

Oxidative stability is measured either by known 
procedures or by the methods described hereinafter. A 
substantial change in oxidative stability is evidenced 
by. at least about 50% increase or decrease (preferably 
decrease) in the rate of loss of enzyme activity when 
exposed to various oxidizing conditions. Such 
oxidizing conditions are exposure to the organic 
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oxidant diperdodecanoic acid (DPDA) under the 
conditions described in the examples. 

Alkaline stability is measured either by known 
procedures or by the methods described herein. A 
substantial change in alkaline stability is evidenced 
by at least about a 5% or greater increase or decrease 
(preferably increase) in the half life of the 
enzymatic activity of a mutant when compared to the 
precursor carbonyl hydrolase. In the case of 
subtilisins, alkaline stability was measured as a 
function of autoproteolytic degradation of subtilisin 
at alkaline pH, e.g. for example, 0.1M sodium 
phosphate, pH 12 at 25* or 30* C. 

Thermal stability is measured either by known 
procedures or by the methods described herein. A 
substantial change in thermal stability is evidenced 
by at least about a 5* or greater increase or decrease 
(preferably increase) in the half-life of the 
catalytic activity of a mutant when exposed to a 
relatively high ..temperature and neutral pH as compared 
to the precursor carbonyl hydrolase. In the case of 
subtilisins, thermal stability is measured by the 
autoproteolytic degradation of subtilisin at elevated 
temperatures and neutral pH, e.g., for example 2mM 
calcium chloride, 50mM MOPS pH 7.0 at 59'C, 

The inventors have produced mutant subtilisins 
containing the substitution of the amino acid residues 
of B. amyloliquef aciens subtilisin shown in Table I. 
The wild type amino acid sequence and DNA sequence of 
B. amvloliguef aciens subtilisin is shown in Fig. 1. 
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Residue 

Tyr2l 

Thr22 

Ser24 

Asp32 

Ser33 

Asp36 

Gly46 

Ala48 

Ser49 

Met50 

Asn77 

Ser87 

Lys9 4 

Val95 

Leu96 

Tyrl04 

Ilel07 

GlyllO 

Metl24 

Asnl55 

Glul56 

Glyl66 

Glyl69 

Lysl70 

Tyrl71 

Prol72 

Phel89 

Aspl97 

Metl99 

Ser204 

Lys213 

Tyr217 

Ser221 
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Replacement Amino Acid 



F A 

c 

c 

Q S 
A T 
A G 
V 

E V 
C L 
C F 
D 
C 
C 

c 

D 

A C 
V 

C R 
I L 
A D 
Q S 

C E 
C D 
E R 
F 

E Q 
A C 
R A 
I 

C R 
R T 
A C 
A C 



R 

V 



DEFGHIKLMNPQRSTVW 



H Q T 

ILMPSTWY 

EFHIKLMNPQRTVWY 



DEGHIKLMNPQRSTVWY 



L P 



DEFGHIKLMNPQRSTVW 
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The different amino acids substituted are represented 
in Table I by the following single letter 
designations: 



Amino acid 

or residue 3 -letter 1-letter 

thereof symbol symbol 

Alanine Ala A 

Glutamate Glu E 

Glutamine Gin Q 

Aspartate Asp D 

Asparagine Asn N 

Leucine Leu L 

Glycine Gly g 

Lysine Lys K 

Serine Ser s 

Valine Val V 

Ar gin ine Ar g R 

Threonine Thr T 

Proline Pro P 

Isoleucine lie I 

Methionine Met M 

Phenylalanine Phe F 

Tyrosine Tyr Y 

Cysteine Cys c 

Tryptophan Trp W 

Histidine His H 



Except where otherwise indicated by context, wild-type 
amino acids are represented by the above three-letter 
symbols and replaced amino acids by the above 
single-letter symbols. Thus, if the methionine at 
residue 50 in g. amyloliquef aciens subtilisin is 
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replaced by phenylalanine, this mutation (mutant) may 
be designated MetSOF or F50* Similar designations 
are used for multiple mutants. 

In addition to the amino acids used to replace the 
residues disclosed in Table X, other replacements of 
amino acids at these residues are expected to produce 
mutant subtilisins having useful properties. These 
residues and replacement amino acids are shown in 
Table II. 
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Residue 


Replacement Amino . 


Tyr-21 


L 


Thr22 


K 


Ser2 4 


A 


Asp32 




Ser33 


G 


GXy46 




Ala4 8 




Ser49 




MetSO 


L K I V 


Asn77 


D 


Ser87 


N 


Lys94 


R Q 


Val95 ~ 


L I 


Tyrl04 




Metl24 


K A 


Alal52 


C L I T M 


Asnl55 




Glul56 


A T M L Y 


Glyl66 




Glyl69 




Tyrl71 - 


K R E Q 


Prol72 


D N 


Phel89 




Tyr217 




Ser22l 




Met222 





Each of the mutant subtilisins in Table I contain the 
replacement of a single residue of the £. 
ainvloliauef aciens amino acid sequence. These 
particular residues were chosen to probe the influence 
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of such substitutions on various properties of B ■ 
amvloliguefacien subtilisin. 

Thus, the inventors have identified Met 12 4 and Het2 22 
as important residues which if substituted with 
another amino acid produce a mutant subtilisin with 
enhanced oxidative stability. For Met 124, Leu and lie 
are preferred replacement amino acids. Preferred 
amino acids for replacement of Met222 are disclosed in 
EPO Publication No* 0130756, 

Various other specific residues have also been 
identified as being important with regard to substrate 
specificity. These residues include Tyrl04, Alal52, 
Glul56, Glyl66, Glyl69, Phel89 and Tyr217 for which 
mutants containing the various replacement amino acids 
presented in Table I have already been made, as well 
as other residues presented below for which mutants 
have yet to be made. 

The identification of these residues, including those 
yet to be mutated, is based on the inventors 1 high 
resolution crystal structure of £. amvloliquef aciens 
subtilisin to 1.8 A (see Table III), their experience 
with in vitro mutagenesis of subtilisin and the 
literature on subtilisin. This work and the x-ray 
crystal structures of subtilisin containing covalently 
bound peptide inhibitors (Robertus, J.D., et al. 
(1972) Biochemistry 11. 2439-2449), product complexes 
(Robertus, J.D., et al. (1972) Biochemistry 11, 
4293-4303), and transition state analogs (Matthews, 
D.A., fit al (1975) J. Biol. Chem. 250, 7120-7126; 
Poulos, T.L., et al. (1976) ,7. aiol. Chem. 251, 
1097-1103), has helped in identifying an extended 
peptide binding cleft in subtilisin. This substrate 
binding cleft together with substrate is schematically 
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diagramemed in Fig. 2, according to the nomenclature 
of Schechter, X., |t al. (1967) Biochem Bio. Res. 
Commun. £7, 157* The scissile bond in the substrate 
is identified by an arrow. The P and P« designations 
refer to the amino acids which are positioned 
5 respectively toward the amino or carboxy terminus 

relative to the scissle bond. The S and S» 
designations refer to subsites in the substrate 
binding cleft of subtilisin which interact with the 
corresponding substrate amino acid residues. 
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Atomic Coordinates for the 
Apoenzyme Form of S# Amvlolicruef aciens 
Subtilisin to l.SABesolution 



1 


itt 


II 


11.45* 


13.195 


-21 .75* 


1 


At* 


c 


11*731 


11.955 


-21.3** 


1 


ALA 


ci 


21.019 


11*511 


-21.193 


z 


4L* 


CA 


17*219 


49*001 


-21.4 54 


z 


4L« 


0 


11* 74) 


47* 1*5 


•21 .491 


2 


ELM 


C4 


15*121 


47. IBS 


"-21.927 


2 


frLH 


Otl 


13*023 


41.43 2 


-22 . 9*7 


1 


lt» 


M 


1 T.4T7 


47.205 


—1 9 .152 


3 


111 


c 


1**711 


44*91* 


—19*490 


3 


SEt 


CI 


11.111 


45.131 


-11.0*9 


4 


9AL 


« 


14* 991 


43*44* 


-19*725 


4 


tAL 


c 


14*121 


41.954 


— 1 1.290 


4 


7AL 


Cl 


11* 001 


41**22 


-20.022 


4 


V AL 


C42 


14*137 


42.24* 


-22.19* 


5 


*tD 


CA 


19.914 


41.415 


-1*. 027 


5 


MO 


0 


14. IIS 


19.243 


-1 7* 144 


5 


MD 


cc 


13-141 


43.215 


—1 1*921 


4 


I7A 


ii 


14.141 


39*24 0 


-15.417 


4 


7Tt 


G 


15-159 


3*. 975 


-1 5. 521 


4 


771 


CI 


17*124 


17*323 


•14.134 


4 


771 


CD1 


11.417 


35*452 


-14.14* 


4 


7TI 


tei 


11*135 


34.170 


-14*453 


4 


771 


C2 


11*222 


33*154 


-11*421 


7 


CL1 


n 


14. +44 


17.342 


-1***30 


7 


6LT 


c 


12*410 


14.535 


-15*470 


• 


7AL 


m 


12.441 


17*529 


-14 .541 


1 


VAL 


c 


12*143 


34.433 


-1 1.73S 


• 


9AL 


Cl 


11.741 


31*100 


•11*547 


• 


7 At 


C42 


11.191 


19.119 


•17*733 


1 


SEt 


CA 


14.419 


15*342 


-19. 142 


9 


111 


0 


14*112 


13*114 


•11.101 


9 


SEA 


oc 


14*142 


1**747 


•20.351 


ii 


6LH 


CA 


11*9*4 


12**34 


•14.17* 


it 


4L« 


6 


12.715 


10*442 


-17*413 


u 


CL* 


CC 


14*295 


11*417 


•14.511 


14 


Cm 


CE1 


14*514 


13.1*1 


—12.744 


11 


tLE 


if 


1 1*421 


12.575 


•1 7*471 


It 


ILI 


C 


10*209 


11. 792 


— * 9 - 4.A t 


11 


ILE 


Cl 


1* 132 


12 . 4 & 9 


A I alf 3 


11 


til 


CC2 


1.1*2 


32**55 


_% ft *j j a 


12 


L7S 


n 


11 . 57> 




—20.277 


12 


LTS 


c 


10.45* 


13.11* 


•22*522 


12 


t7S 


Cl 


11*257 


10.44* 


-22.21* 


12 


LTS 


CO 


12.143 


21.117 


-22.159 


12 


L7S 


mi 


14*47* 


27.411 


-20.934 


11 


ALA 


CA 


9. 12 5 


11.191 


-22. 411 


11 


ALA 


0 


1.331 


15*104 


-24.901 


14 


910 


If 


11.332 


15*951 


-23.193 


14 


no 


C 


11.71* 


15.157 


-24*317 


14 


»to 


Cl 


11.442 


14.111 


•24**92 


14 


FID 




12.211 


15.934 


•22*751 


11 


ALA 


CA 


11.179 


13.451 


-27.9*1 


11 


ALA 


0 


14*111 


11.711 


-21.271 


14 


LfV 


H 


t.lll 


14.139 


•27.249 


14 


LIU 


C 


7. 112 


15.125 


-2A.521 


14 


LtU 


Cl 


4.74* 


14.423 


•24.491 


14 


LIU 


en 


1.111 


11.234 


-27. #09 


17 
17 


mi 


m 


1.445 


14.121 


•27.922 


*rs 


c 


1.111 


17.911 


-29.191 


17 


ms 


Cl 


1.711 


19.111 


•27**52 


17 


NCI 


mi 


1.131 


19.117 


•21.272 


17 


«tll 


CEl 


1.224 


21.914 


•24.144 


14 


. Ill 


a 


11*4*3 


27.133 


-11.121 



2 ALA CA 19.111 11.774 -21. 1*5 

1 *L* 0 11.174 11.197 -21.175 

2 4L« • 10*2*1 49.11* -22.14} 
2 CL* C 17.175 47*70* -21.992 
2 4LK C9 24*125 41.740 -22*449 
2 SL* CO 11.112 47.742 -22*930 

2 CL« *E2 14.11* 44.917 -23*92* 

3 SCI CA 17.150 45.1*1 -19.437 
1 SEt O 15*140 45.352 -19*229 
1 SEl 04 17.*»2 44.210 -17*449 

4 ¥AL CA 15*94* 42.419 -19.439 

• »AL fi 17.123 41.17* -11.11* 

4 ¥41 CCI 14.174 41,172 -21*741 

i ; -it-mi 

1 *WQ C 11.101 39.905 -1A.249 

I MO Cl 14.150 41.110 -15.2*3 

5 WO CO 14.144 42.91* -17.417 
4 7ft CA 14.421 37.103 -15*711 

• Tft 0 15.224 21.943 -14.235 
4 771 CC 11.021 35.147 -15.155 
4 7Ti C02 17*49* 34*919 -14*171 
4 771 C62 17.1J5 13*539 -14.379 
4 77* OH 11.312 31.13* -15.994 
7 6L7 CA 13.211 14*44* -14.374 
T 4L7 O 11.747 35.471 -15.113 
I 7AL CA 11.777 27.523 -17.134 

• 7AL Q 11.439 35.71* -19.470 
1 ML C4I 11.104 11.193 -19.143 

• 5«« * 13.441 14.311 -11.775 

• Sf » C 14*119 13.920 -11*945 
t SEA Cl 15.124 11.432 -19*105 

10 ti« If 14.115 13.417 -17*4*2 

10 1LJ C 12.417 11.117 -17*277 

11 M Cl 14.125 12*115 -15.410 
10 CLN CO 14.414 21.911 -13.147 

10 CL* «E2 14.152 10*940 -12.251 

11 XLE CA 10.373 11.904 -11.112 
" 1" 0 1.173 21.333 -21.110 
11 IL£ C61 1.44* 14*117 -11.149 

11 1LE COl 7.111 14.441 -17.923 

12 LIS CA 11.311 32.119 -21*722 
" O 10.171 12*713 -23.41* 
12 L7S CO 12.2*3 21.130 -21.425 
12 LIS Cf 13.123 27*4*7 -21.14* 
11 ALA * 10.199 14*131 -21.991 
11 ALA C 11.12* 31.71* -23.143 
11 ALA Cl 1.115 11.195 -21.5*5 
14 PIO CA 11.115 14.439 -21.120 
14 HO 0 11.771 11.147 -27*445 

14 MO CC 12.121 14.971 -23.221 

15 ALA m 11.141 34.234 -2*. 129 
" Ul C 11.112 31*795 -21*112 
11 ALA Cl 11.112 21.1*9 -27.142 
14 LEU CA 7.791 34.151 -27.12* 
14 VFU 0 7.142 24.12* -21*910 
*t LEO CC 1.711 11.445 -24.122 
14 LEU CM 4*414 32.217 -24.215 
17 IIS C* ••111 11*151 -21*131 
17 HIS 1 1.117 3I.A22 -11.15* 
17 HIS Ct 1.115 19.211 -2**2*2 
17 HIS C12 S.lll 31.924 -IS. 49* 
17 HIS mtz 4*179 39*321 -24.111 
11 111 CA 11*119 3**731 -31.122 
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>• 
ti 
i« 
it 
i* 
it 

IV 

II 

IS 

II 
tl 
II 
n 
ti 
ti 
it 

12 

ti 
ii 
ti 



in t 
in co 

Uk ft 
»L* C 
ftlft CI 
41* CD 

li« *i! 

tlV CA 
4Lt B 
TTl CI 
til 0 
TTt Ct 
TTt COI 
Tfi Cll 
Tit ft* 
TKt CA 
T*t C 
TM S«t 
*L* ft 

*n c 



I* 
14 
I* 
II 
II 
II 
If 
II 
It 

II 

II 

It 

IT 
II 
IT 
II 
II 
II 
II 
tl 
It 
II 
It 
It 



III ft 
lit C 

in CI 
AM * 
AM C 
AM Ct 

am eoi 

VAL M 
VAL C 
VAL CI 
VAL Cll 
ITS CA 
ITS 0 
LU Cfi 
LTl Ct 
VAL ft 
VAL C 
VAL Ct 
VAL CII 
At A CA 
ALA 6 
VAL tl 
VAL C 
VAL CI 



ii val tsa 

II 1LI CA 

II lit 0 

II ILI Cfil 

II 2Lt C01 

II A|» CA 

II Air 0 

11 At* Cfi 

II AM DDI 

II III CA 

II lit 6 

II fttt Ct 

9* 6LT CA 

14 *LT I 

II 1LI CA 

II ILI 6 

11 ILI Ctl 

ft SLf Ctl 

14 4!» CA 



It. lit 
II* HI 
I.IIC 
T.M1 
T.II1 
4.111 
1.S4J 
4. lit 
4*241 
4.111 
|**II 

1.071 

1.410 

1.111 

1*191 

4.242 

I.IIT 

4*311 

I. tit 
•0.117 
-I. Oil 
-1-113 
-I. 71* 
-1.010 

-i.nt 

•4.111 
-4.177 
-A.T12 
-1.T14 
-3. Ml 
•4.111 
"4.401 
-1.0*4 
-16.114 
-4.lt! 
-4. 71 1 
-2.124 
•1*417 
-J. 747 
-4.44* 
•4.01T 
•l.tll 
-1.114 
•1.011 
-I. Ill 
-1.111 
-7.1*1 
-1.417 
-1.0*4 
•4.117 
••.*« 
•1.111 
-1*111 
•1.TI4 
I. til 
-I. lit 

-1.144 
0.201 

-e.iiT 
•o.uo 

-0.042 
I* HI 



14.111 
11. Tit 
11.411 
14.111 
11.141 
11.767 
10. Ill 
11.117 
It. 174 

11.111 
11.174 
11.714 
14.7*4 
14.241 
14.141 
40.127 
41-721 
41*407 
40.211 
41.411 
41.147 
41.114 
41. 12C 
41.412 
42.17} 
41.127 
41.747 
42.4*0 
42.112 
40.101 
It. HI 
41.124 
41.171 
44.171 
41.417 
41.442 
42. VM 
42.444 
41.101 
44.110 
41.I4S 
41. Oil 
4|.40t 
41,110 
41.114 
44.144 
41.111 
41.107 
41.114 
44.447 
41*410 
41.702 
44.411 
41.107 
12*114 
IB. 021 
11.72* 
10.011 
12.411 
14.411 
10.210 
40.411 
01.411 



-12.171 
-11.171 

-11.141 

-12.102 
•12.200 
-11.111 - 
•10.21* 
•12.011 
-12.211 
-21*741 
-27.714 
•10.711 
-11*117 
-12.111 
•14.210 
-IT. lit 
-11.125 
-11.117 
•24*41] 
•21.111 
•27.171 
•27.144 
•21*120 
-27.111 
-24.201 
•21.709 
-11.011 
-21*2*2 
-22.111 
-22*021 
-10.011 
-II* IT! 
•10.411 
-22.410 
-21*117 
•11.209 
-14.021 
•17.012 
•11.171 
-14.411 
-11.104 
•11.072 
•10.411 

-I2.M* 

- -11.107 
•1.4T1 
-0.1V7 
-O.Ttt 
-t.TH 
-4.211 
•1.102 
•4.ITI 
-1.110 
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-1 5.945 


49 


ALA C» 


2.332 


47.051 


-13*154 


»13.914 


Tt 


4LT CA 


4.995 


44.905 


-14.470 


-15.021 


?• 


CLt 0 


7*4*4 


43.154 


•14.111 


-14.3 31 


71 


ca 


7.1TT 


43.019 


-14.444 


-15.943 


71 


Tort 0 


4*492 


41.920 


•14.495 


-13.191 


71 


7K4 0C1 


§.191 


42.592 


-12*390 
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71 


tKt 


CC2 


7.274 


44.583 


-13.394 


72 


VAL 


CA 


3.974 


4Z.441 


-14.484 


72 


VA1 


4 


4.341 


42.314 


-18*444 


72 


VAL 


CCS 


1*3X2 


42*444 


-17.174 


71 


AL* 


m 


4.43* 


44.417 


-17.3*8 


75 


AL* 


c 


4.433 


44.333 


-14.355 


73 


4L* 


C4 


3*147 


45*441 


-19*433 


74 


4L4 


CA 


7.474 


47.591 


-14.939 


74 


ALA 


4 


3*454 


44*440 


-21.054 


7S 


LEU 


■ 


7.454 


44*744 


-21*839 


75 


LEU 


C 


t.3§2 


44.544 


-22.444 


75 


L2U 


C4 


7.344 


40.471 


-22*809 


7* 


lcu 


CD1 


4.074 


52-434 


-22*390 


74 


ASM 


M 


4.147 


44*103 


-24,159 


74 


AS* 


001 


xe*t$o 


45.440 


-27.924 


74 


454 


C4 


it.aio 


44.451 


-25*994 


74 


45* 


C 


44.743 


49.948 


-25.443 


77 


ASH 


M 


It. 104 


49.444 


-25.071 


77 


ASM 


C 


13.707 


31*429 


-25.348 


77 


AS4 


C4 


31.333 


52*474 


-25.117 


77 


ASM 




12.032 


51*344 


-22.917 


71 


set 


M 


14.125 


52,2*7 


-25.154 


71 


set 


C 


13.414 


32.742 


-23*434 


7* 


set 


C4 


is. 90s 


53*44] 


-25*517 


79 


ILC 


m 


l4*4St 


52.545 


-22.529 


7t 


XLE 


c 


14*417: 


51.443 


-20*230 


7t 


XLE 


C4 


14.471 


54.174 


-26.497 


7* 


It E 


C42 


14.447 


55.320 


-21*412 


• ft 


4L7 


a 


14.4*5 


41*744 


-14.411 


40 


6Lf 


c 


14.412 


45*444 


-14.219 


• 1 


VAL 


II 


13.3X3 


48*744 


-17*930 


tl 


VAL 


c 


12. SIX 


44.919 


-19.217 
-14*477 


41 


VAL 


C4 


13.40X 


44*735 


11 


VAL 


CC2 


11-433 


47.241 


-14*231 


42 


Lf U 


CA 


11.312 


45.020 


-20.254 


12 


LEU 


0 


14*451 


43.334 


-14.400 


42 


LEU 


Cfi 


11.430 


43*548 


-22*354 


42 


LEU 


C02 


12.359 


42*475 


-23.142 


• 3 


417 


CA 


4.133 


43.321 


-19.1X4 


• 3 


6L7 


0 


4*444 


41.822 


-21*024 


4* 


VAL 


CA 


4*473 


39.807 


-19*814 


44 


VAL 


0 


4*424 


34.472 


-22*144 


44 


VAL 




4*410 


37.477 


-19.557 


as 


ALA 


a 


5.154 


40.924 


-21.024 


45 


ALA 


c 


4*213 


42*483 


-22.34* 


IS 


ALA 


C4 


2.444 


40.443 


•21.748 


44 


ttft 


CA 


5.413 


44.43S 


-23*245 


44 


FttJ 


0 


4.291 


44.403 


-2 3.849 


44 


rao 


C4 


7*430 


43*444 


-2 4.544 


47 


4ft 


II 


3*544 


44.474 


-24.744 


47 


&f I 


C 


1*103 


45.132 


-24.447 


47 


set 


CA 


2*401 


44.777 


-24.52? 


44 


ALA 


a 


1*4X7 


4.4.54* 


-23.742 


4* 


ALA 


CA 


-4.273 


44.953 


-23.044 


44 


ALA 


4 


-4.174 


44.717 


-22.435 


44 


set 


« 


-4,1*4 


47.102 


-24.280 


■ 4 


SI* 


CA 


-3.401 


44.447 


•22.227 


49 


set 


0 


•3*743 


45.844 


-20.209 


44 


LEU 


CA 


-2.371 


47.447 


-18.543 


44 


LEU 


0 


-3*542 


49.404 


-14.215 


44 


LEU 


C4 


-4*233 


47,851 


-17.174 


44 


LEU 


CD2 


1.140 


49.524 


-17.447 


41 


1ft 


CA 


-5.25* 


44.474 


-14.137 



71 


VAL 


It 


4,434 


42.487 


-11-42? 


JI 


*At 


C 


4.312 


43*484 


-17.431 


71 


VAL 


CA 


2.314 


42*44? 


-14.38* 


72 


VAt 


C62 


2.142 


42*327 


-14.723 


71 


At* A 


CA 


4*587 


45.491 


-19.1*7 


73 


ALA 


0 


9-442 


47.141 


•28.21* 


74 


ALA 


V 


4*344 


44.424 


•18.435 


74 


ALA 


C 


7.740 


47.448 


-23.342 


7* 


ALA 


ca 


4*453 


47.444 


-17.525 


75 


LEU 


CA 


7.312 


48*948 


-22.454 


75 


LEU 


0 


14.142 


43.754 


•22.253 


75 


LEW 


cc 


4.123 


31.913 


-22.373 


73 


LEU 


CB2 


5.494 


51*442 


-21*445 


74 


ASM 


8102 


32.385 


45.432 


-24*344 


74 


ASM 


cc 


11.143 


44.274 


•24.802 


74 


ASM 


CA 


14.359 


47*738 


-24.438 


74 


ASV 


D 


19.157 


49.479 


-24.419 


77 


ASM 


CA 


12.220 


58*957 


-25.481 


77 


ASH 


D 


14.354 


49.979 


-25*313 


77 


ASM 


CC 


11*250 


52.027 


-23*414 


77 


ASM 


M02 


14*294 


52.741 


-23*825 


74 


set 


CA 


15.513 


52.414 


-24*988 


78 


set 


D 


14*982 


53*971 


-21.144 


78 


set 


OC 


15*424 


53.470 


-24.4*4 


79 


ZLE 


CA 


15*155 


52.744 


•21*124 


79 


XLE 


0 


13.443 


55.441 


-24.479 


79 


XLE 


C42 


12.445 


44.412 


-24.814 


79 


XLE 


CC1 


12*135 


55.174 


-24.155 


44 


CLt 


CA 


14.474 


55*444 


-17*913 


44 


CLT 


0 


15*719 


41.994 


-18.5*4 


41 


VAL 


CA 


13.411 


47.244 


-18.4*1 


41 


VAL 


0 


12.240 


47.734 


-20.117 


41 


VAL 


CC1 


14.430 


47*444 


•15.573 


42 


LEU 


If 


12.124 


45.445 


-19*214 


42 


LEU 


C 


14.390 


44.428 


-19.510 


42 


Lty 


CI 


12.284 


44*219 


-21.223 


42 


LEU 


Cftl 


14. 744 


44.457 


-23.223 


43 


CLT 


M 


4.131 


44.140 


-19.814 


43 


CLT 


C - 


t*427 


41.411 


-14*923 


44 


VAL 


M 


7.272 


4L112 


•19.243 


44 


VAL 


c 


4.144 


48.830 


-21*148 


44 


VAL 


C4 


4.254 


38.920 


-14.841 


44 


VAL 


CC2 


7*190 


38*507 


-17.705 


45 


ALA 


CA 


4.217 


41.144 


-22.154 


55 


ALA 


1) 


1.240 


43.401 


-22*934 


64 


MO 


M 


4*240 


43.184 


•23.459 


44 




C 


4.321 


45.371 


-23.947 


4* 


rto 


£5 


4-122 


44.784 


-23.813 


44 


*40 


CO 


4*377 


42.4*0 


•23.434 


57 


4ft 


CA 


2.484 


41*324 


•25*529 


47 


set 


0 


4*142 


45.313 


-25*419 


47 


SEt 


OS 


3*991 


45.143 


-27.443 


84 


ALA 


C4 


-4.143 


43.514 


-21*428 


84 


ALA 


C 


-4.498 


43*717 


-22.440 


89 


SEt 


M 


-2.219 


49.*91 


-22.478 


49 


4€t 


C* 


-4.343 


41.433 


-22*848 


49 


set 


c 


-3. '134 


44.780 


-2«.?27 


40 


LEU 


■ 


-2*44* 


47.454 


-20.931 


40 


LEU 


c 


-3.433 


43.434 


-17.3*4 


90 


LEU 


C9 


-4*911 


48*273 


-18*42* 


94 


LfU 


C01 


-4.424 


45.34, 


-17*214 


91 


??t 


If 


*4*244 


47.944 


-14.938 


91 


7ft 


c 


-4*473 


44.734 


-14.485 
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41 TTt i 

*1 TTt tt 

41 TTt CD2 

41 ??■ CE2 
fl TTt 0* 

42 AL ■ CA 
4J ALA 0 

43 fit n 
93 TAt C 
93 V A L C9 

93 TAL C62 

44 LIS CA 
44 LTS C 
44 ITS C6 

44 LTS CE 
95 VAC M 
95 TAL C 

45 TAL CI 
45 4*L Ct2 

94 LEU CA 
44 LEU 0 
44 LEU CC 
44 LEU C02 
ST 0LT CA 
47 fcLT 0 
41 ALA C0 
41 ALA C 
94 ASP II 
94 ASP DDI 
91 ASP CS 
94 ASP C 

is: tLV A 
lift «L? £ 

lei sei a 

1C1 SEE C 

Itl SEE C» 

102 tt 
162 CLT C 

103 tl« fl 
103 0LI C 
103 CLR Ct 
103 tL« Ct» 

103 (IB «2 
114 TTt CA 
194 TTt 0 
194 TTt Ct 
1*4 TTt €02 

104 Tfl Ct 2 
104 TTt CHI 
IDS 1E1 CA 
ItS SEE * 

1#5 set 04 

194 TIP CA 

104 TtP 0 

194 TIP Ct 

194 TIP CD2 

194 TIF CE2 

204 TIP CZ2 

Iftt TIP CM2 

10T HE CA 

&QT 1LE 0 

19? IlE CC1 

19? UE C01 



-4.494 
-7.994 
-7.971 
-1.31* 
•9*192 
-4.*49 
-4.723 
-$•959 
-4. TCI 
-7.957 
•I. ITS 
-4.379 
-4.451 
-5*394 
-4.344 
-4.404 
-4.914 
-9.104 
-4.900 
-4. 742 
-J. 4*2 
-3.543 
-4.404 
-3.940 
-1.419 
-9*429 
0*194 
-9.S04 
-2.7*9 
-9*449 
9.144 
-4*424 
-1*920 
-2.542 
-4.750 
-1.T1A 
-9.421 
-4.144 
-9*377 
-10*443 
-11.471 
-12.349 
-13.419 
-12.449 
-12.439 
-11*429 
-10*379 
-4*392 
-9.411 
-14.977 
-14.754 
-15.204 
-32-421 
-12.421 
-11.445 
-19.459 
-11*354 
-14.471 
-4.293 
-19*145 
-11*095 
-9.434 
-4.2*3 



47.T44 

44.237 

49*175 

44.421 

4I.T52 

44.144 

99.990 

49*493 

49.914 

47.953 

47.374 

50.4*4 

50.440 

92.320 

54.201 

45-071 

49*494 

47.999 

44.140 

44.193 

51.121 

44.794 

44.012 

52.397 

51.443 

55*479 

53.139 

52.973 

SO. 492 

51.493 

54.145 

49.103 

47.451 

44.124 

47.994 

47.447 

4T.492 

44.534 

47.954 

45.232 

47.307 

40.104 

49.147 

65.124 

43.274 

40.029 

40.954 

40.057 

10.141 

45*144 

49.439 

47.434 

47.391 

44.449 

44.111 

49.412 

99.573 

91.319 

91*291 

44.250 

43.474 

43.794 

42*999 



-14*923 
-IT. 741 
-10.144 
-14.442 
-21.T44 
-12.747 
-12.050 
-11.124 
-1.549 
-10.411 
-12.472 
-4.449 
-5.703 
-5.44? 
-4*199 
-5.924 
-Z.54I 
-4.314 
-4.332 
-1.444 
-2.334 
-2.972 
-1.445 
0.297 
0.145 
- 1.519 
3.917 
2.912 
4.093 
5.175 
9.320 
2.140 
2.002 
2.401 
2.532 
4.917 
2.577 
2.529 
2*444 
2. 022 
3.274 
4*915 
4.112 
1.544 
-4. 447 
2.4T2 
1.940 
2-171 
3.324 

-0.OJ4 

-2*254 
1.450 
-1.940 
-4.245 
-9.294 
4.411 
1.541 
2.449 
2.455 
-3.125 
-5.344 
-1.434 
-0.427 



91 TTt C9 

91 7TB C01 
41 TTt CE1 

41 TTt C2 

92 &L« m 

42 ALA C 
42 ALA C9 
91 9AL C& 
91 9AL 0 

93 9AL CC1 
*4 ITS * 

94 ATS C 
94 LTS C0 
94 LTS CD 

94 ITS RZ 

93 VAL CA 
45 9AL 0 

95 9AI CC1 

94 LEU H 
94 LEU C 
94 LEU CA 
94 LEU CD1 
97 *LT ft 
47 CLT C 
41 ALA W 
91 ALA CA 
94 ALA 0 
99 ASP 002 
49 ASP CC 
99 ASP CA 
49 ASP G 

100 SLT CA 

190 GLT 0 

191 SEE CA 

101 SEE 0 

101 SEE OC 

102 fiLT CA 

192 CLT 0 

103 CLN CA 
101 0L9 ' 
103 CLN CC 

103 6LH OE1 
194 TTt 0 
194 TTt C 

104 TT1 C9 
104 TTt CD1 

194 TTt C51 

104 TTt C2 

105 SEE It 

195 SM C 
145 SEE C9 
104 TIP * 
104 TIP C 
104 TIP C4 
114 TIP C01 
194 TIP nei 
304 TI9 CIS 
144 TIP CI3 
19T ILE tt 
14? ILE C 
107 ILE C5 
107 ILE CC2 
109 1L! * 



-0.494 40.493 

-4.595 47.415 

-4*995 47.5T2 

-T.T94 40.142 

-4.895 49.954 

-5*923 99.433 

-1.997 91.421 

-7.143 41.154 

-4.191 47.993 

-§.215 47.418 

-0.947 54.217 

-7.331 49.995 

-4.951 51.914 

-4.944 93*725 

-3.73* 55.544 

-7.444 49.457 

-7.425 41*194 

-0.949 44*452 

-9.474 49.974 

-4.331 55*55* 

-3.599 41*241 

-2*207 44.114 

-4.324 59.975 

-2*143 52.43? 

-1.454 51.441 

-4.541 54.441 

1.193 52.921 

-2.431 91*942 

-2*993 51*131 

0.191 51.411 

9.735 49.113 

•0.343 49.521 

-1.449 44.512 

-1.542 47.101 

-4.754 44.472 

-4.411 41.434 

-7.477 47.422 

-7.181 45*431 

-10.315 44.247 

-15. 774 45.412 

-11.344 49.905 

-22.159 49.114 

-11.411 44.141 

-11*931 43.490 

-12.49? 41.944 

-11.419 14.719 

-10.409 14.115 

-9.544 19.422 

-13.C04 44.572 

-14.172 49.420 

-15.940 44.121 

-13.474 44.425 

-11.495 44*434 

-11.321 44.254 

-12.442 49.924 

-12.441 94.151 

-9. 2?5 49*492 

-4.444 99*443 

-11.114 45.119 

-11.955 01.54* 

-4.944 43*19r3 

-9.412 41.930 

-12.994 41.242 



-16.514 
-10.755 
-20.@5ft 
-20.443 
-14.104 
-11.903 
-12.4E0 
-14.325 
-4.372 
-9*725 
-4.121 
-5.494 
-4.311 
-5.542 
-4.397 
-1.920 
-1.501 
-9*414 
-2.494 
-1.121 
-1.573 
-2.143 
-9.954 
0.115 
0*754 
0.945 
1*443 
0.151 
5.049 
1*955 
4.929 
1*415 
1.474 
1.115 
1.407 
5.294 
1*044 
1.830 
1.429 
0.117 
4.594 
5.492 
2*451 
0*473 
2*143 
1*377 
3.T07 
1.401 
9.403 
-1.154 
0.401 
-9.13* 
-1*912 
-1.155 
9.244 
1.340 
0.574 
1.524 
-2*411 
-4.199 
-2.423 
-1.151 
-1.577 
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1M 
10B 
100 

ioo 

107 

IM 

lit 
ltt 

no 
no 
in 
in 
in 
in 

112 

112 

112 

112 

113 

113 

113 

113 

113 

111 

113 

114 

11* 

11* 

113 

115 

119 

115 

11* 

114 

117 

11? 

117 

117 

Hi 

Hi 
Hi 
110 
lit 
lit 
lit 
lit 

120 
220 
120 
12i 
121 
121 
121 
121 
122 
122 
122 
122 
123 
123 
123 
121 
124 



ZLC CA 
3LE 9 
ZLI CC1 
lit CO! 
ASH CI 

asm a 

43* CC 
ASM H02 
tL T CA 
Hi C 
IH CA 
ZLC • 
ILC C61 
ICC C01 
frLU CA 
CLU 0 
frLU CC 
0E1 

tip m 

TOP C 
TAP CA 
TAP C01 
TAP ME 1 
TAP CC3 
TAP C23 
ALA «f 
ALA C 
ALA Ci 
ZLC CA 
ZLC D 
ZLC CCl 
ILE CD1 
ALA CA 
ALA B 
ASM M 
ASM C 
ASM CO 
ASM DO 1 
ASM M 
ASM C 
ASM Cl 
ASM 001 
MET • 
Ml T C 
PET Ci 
Mlt SD 
ASP M 
ASP C 
ASP Ci 
ASP Ml 
t AL M 
VAL C 
7AL CA 
VAL CC2 
ZLC CA 
ZLC « 
1LC CCl 
ZLC C91 
ASM CA 
ASM ■ 
ASM CC 
ASM M2 
MET CA 



-14.314 
-14,194 
-14*72* 
-15*432 
-15.204 
-14.4*0 
-14.523 
•14,433 
-11.752 
-11.327 
-12.403 
-13.721 
-11*421 
-13.5*3 
-14.113 
-14***7 
-17.347 
-It. 041 
-35.074 
-14.074 
-13*312 
-24*1*3 
-13.397 
-31*451 
-20.410 
-13*017 
-13.177 
-11.297 
-23.070 
-14.077 
-15.210 
-14.004 
-17*370 
-12.323 
•15.423 
-13*027 
-13.415 
-14.545 
-14.223 
-12*2*0 
-14.2*7 
-14,510 
-11.404 
-10,025 
-7.010 
-A.7M 
-A. 70* 
-7.022 
-7.555 
-7,301 
-7.021 
-4S.274 
-+.75S 
-4.707 
-4.241 
-4.027 
-t.*t* 
-9.774 
-3.1*5 
-3*700 
-0**92 
-••3*4 
-3*430 



42.722 

43.320 

41.077 

40.045 

44.010 

44.272 

47.40(3 

40.447 

45.717 

44.727 

42.33* 

42.334 

40*301 

35.706 

43.174 

44.130 

42*917 

40.644 

45,403 

45,443 

47,553 

49.734 

50.443 

47,445 

47.179 

44,001 

43.179 

43*172 

41.440 

42,225 

39.034 

39,411 

44,4*0 

45,235 

45*370 

44.974 

44.751 

49*012 

43.725 

42.444 

42*043 

42*321 

*2.ses 

40*734 

42*441 

44.943 

40**37 

3*. 390 

39.354 

40*704 

57.117 

37*53* 

30.307 

37*914 

19.797 

30*012 

40.392 

39.700 

99,05* 

41.431 

40,0*0 

40*747 

39*973 



"**S21 
-4*552 
. -2,482 
-1,131 
-3.914 
-0.235 
-4.353 
-3.4*2 
-7.0*5 
-10.03* 
-9.077 
-11.1*0 
-7*455 
-4*334 
-10.0*4 
-12.2*4 
-0.135 
-0,014 
-30*971 
-13.1*0 
-11*434 
-12.401 
-13*723 
-13.907 
-14.377 
-12.132 
-1**752 
-13*1*0 
-14.037 
-17.070 
•13.0*3 
-11*7*3 
-14*050 
-10.343 
-17,122 
-19.034 
•17.424 
-17.773 
-10.947 
-17.1*3 
-21.279 
-20.759 
-10.475 
-10.921 
-17.055 
-17.325 
-19.50* 
-21.354 
-21.234 
-23.00* 
-10.115 
-15*71* 
-1 7.*94 
-30.044 
-13.397 
-12*449 
-13*041 
-12*313 
-11.232 
-9-033 
-10.777 
-9.720 
-7.434 



300 
309 
100 
109 
107 
109 
109 
110 
119 
111 
311 
111 
111 
112 
112 
312 
112 
112 
113 
113 
113 
113 
113 
113 
113 
11* 
11* 
115 
115 
115 
115 
114 
11* 
114 
117 
117 
117 
117 
111 
110 
110 

lit 

117 
117 
339 

119 
120. 
120 
120 
120 
121 
121 
121 
122 
322 
122 
122 
323 
123 
323 
123 
12* 
12* 



ILf C 
ZLI CO 
ILf Ct2 
ASM ft 
ASM C 
ASM CO 
ASM *3l 
617 M 
CLT C 
ILf n 

ILf C 
XLE CO 
UC CC2 
ALU M 
SLtl C 
GLU CO 
CLW CO 
tLti 022 
TOP CA 
TOP D 
TOP CO 
TOP CO* 
TAP CC2 
TIP CZ2 
TAP CHZ 
ALA CA 
ALA 0 
ZLC M 
XLC C 
XLC CO 
XLC CC2 
ALA M 
ALA C 
ALA Ci 
ASM CA 
ASM 0 
ASM CC 
ASM MD2 
ASM CA 
ASM 0 
ASM CC 
ASH M02 
MET CA 
ME7 0 
MET CC 
*ET C* 
ASP CA 
ASP 0 
ASP CC 
AS» 002 
7AL CA 
VAL 0 
VAL C61 
XLC M 
XLC C 
XLC CO 
XLC CC2 
ASM M 
ASM C 
ASM C3 
ASM 091 
MET M 
MST C 



-14.439 
-15*2*4 
-14.540 
-14.751 
-14.232 
-15*2*0 
-17.455 
-12*951 
-12.101 
-12.379 
-13.059 
-12*73* 
-13.122 
-1**873 
-13.072 
-17.227 
-15.724 
-17.123 
-I4.T34 
-14.319 
-33.494 
.-12.441 
-12.5*5 
-11*494 
-10.752 
-12.333 
-12.943 
-14.174 
-35.920 
-14.000 
-17.151 
-14.53* 
-14.704 
-11.011 
-14.553 
-12.977 
-14,400 
-14.931 
-13.740 
-11.417 
-13.737 
-10.134 
-10.232 
-10.000 
-9.000 
-9.902 
-0.410 
-0.030 
-0*237 
-9.327 
-0*224 
-4.214 
-3.751 
-4,310 
-5,020 
-7.474 
-7.221 
-4,243 
-3.102 
-1-021 
-0.003 
-3.450 
-2.423 



43*494 

42.245 

42.024 

44.950 

44.047 

47.359 

44*495 

45,900 

44.712 

43.533 

42.540 

40,9*0 

37.791 

43*075 

44.3*7 

43.097 

41.024 

41.720 

44.400 

45*732 

41.554 

40.952 

47*741 

30.0*5 

47,074 

44.045 

43.074 

42.3*0 

42.415 

40,0*0 

40.140 

43.527 

45.047 

45,510 

45.947 

45.434 

40.177 

*0.2*9 

42.442 

42.307 

43.0*0 

44*034 

42.222 

37.030 

43.003 

44.041 

39.110 

37.107 

37.730 

37,135 

30.401 

40.700 

30.174 

30*970 

39.242 

39.404 

39.003 

40.222 

40*404 

40.470 

31.990 

37*40* 

37.403 



-5-33* 
-3.320 
-4.093 
-4.781 
-7.184 
-3.207 
-4.444 
-4.774 
-0.012 
-0.244 
-7.7*2 
-0.344 
-9.347 
-3*200 
-11*171 
-9.141 
-0.415 
-9.0*4 
•12.000 
-14.332 
-22.411 
-13.*43 
-14.215 
-15.27* 
-15.403 
-13.074 
•15*970 
-1**117 
-15.034 
-13.922 
-14.755 
-15.247 
-17.270 
-15.151 
•10.137 
-19.020 
-14.939 
-13*734 
-17*032 
-20.932 
-21.395 
-22*133 
-10**70 
-10.757 
-14.302 
-10.2*3 
-20.130 
•10.470 
-22-*S* 
-22*733 
-14.974 
-15.909 
-14.*27 
-1*.370 
•12.427 
-12.44* 
-10.75A 
-12.110 
-9.041 
-11.497 
-11*010 
-0.132 
-4.41* 
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114 «!' S 

|I* HIT CC 

||4 BIT CI 

It* lit CA 

HI *»■ ■ 

111 IM *• 

II* IfV t* 

12* UU 9 

114 LIU CO 

It* LtV UX 

117 SIT t* 

lit CLf t 

111 ILf t* 

lit *LT 0 
til MC Cl 
lit MO 0 

HI tts « 

116 lit « 

H« IM C 

110 IM CI 

111 it* * 

111 *L1 C 
111 HI * 

in si« t 

111 If! Cl 

111 It* •} 

HI AL* C 

113 At * Ci 

114 Ail CA 
114 *U 0 
111 LIU H 
HI tlw C 
111 bib CA 
131 LtU CB1 
11* it! k 
11* itl C 
II* Ltl Cl 
11* 11* CO 
II* US HI 
11T AL* CA 
lit Al« 0 
111 *LA * 
111 AH C 
111 All Cl 
II* V4L CA 
111 *Al 9 

lit vn csi 

1*1 Alt M 

1*1 Al» C 

|40 Alt Cl 

1*£ Alt DDI 

1*1 Ltl « 

1*1 IT1 t 

141 lt| Cl 

141 ltl Ci 

1*1 ltl Hi 

1*2 ALA CA 

1*2 ALA 0 

|*| VAL * 

|41 V*L C 

1*1 V AL Cl 

|*1 V AL t&t 

1** ALA CA 



-J. 10* 
•A till 
-1.»40 
-1.1*1 
MM 
1.4** 

*1.**1 

-I.I** 

-l.wi 

-*.H* 

-i.ni 

-!»*** 
-4.*T| 
-4, til 
-4.411 
-A. Ill 
-A.Alt 
-1.0*1 
-t.lll 
-1*64* 

-icon 

-12.205 
-11-140 
-1S.2I! 
-14.1*0 
-14.147 
-11**10 
• 11. t** 
-IT. 172 
-14.111 
-11.471 

-14.111 

-13, CM 
-11**40 
•14.101 

•11.144 

*14.*03 
-11.011 
-11.141 
-It. ttl 
-IT. TO* 
-1**121 
-14.101 
•If. 122 
-12.14* 
•11.201 
-Ifi.tlt 

-I4.lt! 

•2A.821 
-14.141 
-14.171 
-14.411 
•14.111 
-11.111 
-ll.lt* 
-21. HI 
-I4.IT1 
-ll.TTO 
-11.112 
-14.14* 
-11.111 
-11.101 
-14.UA 



11.101 
**. Ill 
11.111 
41. til 
41.417 

*t.4l* 
41. lit 
fl. HA 
41.4*7 
A2.t*0 
11.IU 
It.ttO 
1T.41A 
11.1M 
14.121 
12.117 
14.11* 
11.011 
14.114 

11. Ill 
11.1*7 

14.113 

11.011 

14,101 
14.121 
S4.lt! 
14.1*1 
11.121 
17.211 
ST. Hi 
11.224 
14.001 
St. 121 

11.411 

1»«I21 
11.711 
12.1*1 
21. M2 
21- AU 

14.41* 
21.041 
14.101 
14.41* 
11.1*1 
SI.2H 
14.070 
SJ.ll* 
11.11* 
11.111 

11.141 

10.401 
S4.2A1 
SS.411 
1**271 
BI.U7 
40.011 
14.112 
11.1*1 
11.11* 
12.291 
11. *tl 
12.111 
11.11* 



-4.013 

-t ,471 

•T.142 
•I.T4* 
-1.101 

-mil 

-2.11* 
•2.12* 

•1.311 
•4.07J 
1.1*1 
2.120 
1.142 
1.21* 
I. til 
A. 101 
T.T21 
1.112 
4.12* 
1.21* 

4.141 

1.1*2 
4.114 
l.tl* 

1.1*1 

2*2** 
1.017 
1.104 
-O.T*2 
-2.1*9 
-2.94* 
-I. TO* 
-O.ttl 
-2.212 
-2.171 

-4.110 

-2.11* 
•2.114 
-4.140 
-4.113 
•1.201 
•l.Tlt 

-T.llt 
-1.114 
-T.tlt 

- -1.111 

-1.1*4 
-••122 
•10.014 
•l.tll 
-7*112 
-f.tll 
-11. t** 
•10.121 
-10. lit 
-10.211 
-12.il* 
• 14. tM 
-12.112 
-14.4*4 
-11.11* 
-12.01* 
•14.441 



12* «|7 Cl 
124 a|t 1? 
121 lit 4j 
111 11* c 
22* lit Cl 
11* UU * 
124 LIU C 
LtU Cl 
LIU C01 
SH N 
ALT C 
tLt « 
ALT C 
#10 * 
MO C 
MO Cl 

rto co 

It* Cl 



114 

11* 

121 
127 
121 
121 
lit 
121 
III 
lit 
210 



110 11* 0 

113 lit OC 

131 tLt CA 

112 fill 9 

113 It* C* 
112 ill 0 

112 1ft et 
313 All CA 

113 ALA 6 
134 AL* « 

114 AL I C 
114 ALA Cl 
111 LtV Cl 
111 LIU 0 
HI LtU CC 
111 LtU C02 

lit it* ca 

11* 111 c 

114 HI CC 

114 HI Cl 

111 ALA fcj 

117 ALA C 

117 ALA Cl 

1)1 All CA 

111 AL* e 

31* VAL fc 

lit TAt C 

lit VAL Cl 

H* VAL C02 

140 Alt CA 

140 AS* 0 
14? Sit 04 

i*o sit *ri 

141 ltl CA 
143 ltl 0 
141 ltl CO 

i*i iti ce 

|42 ALA H 

1*2 AL* C 

1*2 ALA Cl 

143 VAL CA 
141 f AL C 
141 VAL ttl 

144 AL* N 
144 ALA C 



-4.441 14.1*1 

-t.ll* 11.4T1 

-1.414 41.4*4 

.■,411 

1.121 M.I27 

•1.411 41.171 

-2.411 !♦*•** 

-l.tll 41.*** 

-1.211 41.111 

-2.122 21-112 

-1.11* fl.HO 

-4.121 H.**l 

•>4.4>* S4.ll* 

•4,11* 11.117 

-A. ill 14.111 

-4.0*0 14.41* 

-4.11* 1**110 

-1,470 14. AH 

-B.ttt IS. HI 

-1.T21 14. *1* 

-10.124 14.11* 

•12.4*1 14.121 

•14.407 11.431 

-14.1*1 JA.MA 

-14.4*1 si."* 

•17.191 14.1*1 

-17.141 14.411 

-IT. All l*»2ll 

-It. 131 17.1*1 

-11.2*3 11**00 

-14.1*1 SI- 244 

-11. 11* 14.110 

-11. All 11.1*0 

•10.112 14.101 

-14.141 11.117 

-11.21* 11**11 

-14*741 11.047 

-11.141 tl-701 

•14.744 14.240 

• It. HI 11-101 

•It.Bt* 14.*4l 

-14.001 11*111 

-14, *|l I4.IA1 

-is, tie 

-11.411 14.221 

-11.110 14.111 

-11. ttl Jiifto 

•11.214 12.411 

•14.010 11*11* 

-ii.iii io.**o 

•14.11* 10.112 

-1T.3T1 11.00* 

-14. TOO S1.J*I 

-11,114 11.014 

-20.1T2 

-11.1*1 11.1*1 

-IS. lit 11.010 

-11.110 M-MJ 

-is.!*! 

-14.14* »»•■** 

-11.100 \**\\\ 

-ii.iii 

.14.12* !*•*•* 



•A.81S 
•*.*S6 
-4*1*2 
**. 1 1 1 
-6. HI 
-1.TT1 
•1.10T 
•2.410 
-2.171 

•1.411 

2.412 
2.22! 

4.104 
1**02 
4.112 
7.314 
4.411 
4*021 
4.011 
C.4BI 
3*174 
4, Til 
S.IU 
4.12* 
1.171 
2.IZ4 
•1*01* 

1.2*4 
-1.474 
-1.111 

-1*10* 
•2.1*0 
-1.111 
-I. II* 
-1.111 
-1.101 
-1.1*3 
•1.771 
-1*1*1 
-4.»*1 
-4.241 
-4.4H 
-1.142 
•t.llT 
•I.T20 
-A.*** 
-4.212 
•l.tl* 
-31.1*0 
-7.10* 
• A.S21 
-11.14* 
•11.111 
-11.1*1 
•11.110 
-11.141 
-13.111 
•li.»*l 
•11**10 
-11. *1* 
-H.**l 
•11.111 
•11.1*1 
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144 *n i 

1*» SIP • 

1*1 tit c 

IM tn CI 

l«* «LT « 

1*4 4L9 C 

147 V4L H 

147 fit C 

t*T 9AL CI 

1*7 VAL CSI 

1*1 VAL CA 

141 VAL D 

141 941 Cftl 

144 VAL ft 

1*9 44L t 

t4f VAL t| 

I4t VAL tci 

111 VAL CA 

It! VAl 0 

lie Til €61 

111 UA ft 
lit ill C 
itS 4LI CI 

112 UA CA 
lit ALA C 
111 4lA IV 
ill tit C 
Stl ALA Ct 
1*4 « L t CA 

114 alt o 

lit ASK CA 

111 41* c 

111 MX £6 

111 ASM KB2 

lit CkV CA 

lit Ciu D 

lit im ct 

lit ttu on 

iit Sir ft 

in *it c 

111 Till « 

111 7M OH 

ill tut ci 

III T«ft 0 

2M lit 16 

111 Iff CA 

tit fit c 

lit tl7 CA 

lie tit t 
iti trt CA 
iti tit a 

111 Sit ft* 
142 Ml CA 
Iti Iti 0 

112 tti ec 
111 sit ca 
111 in o 

111 tit Ob 
114 VM CA 

144 ?«a e 
it* TMt eti 

111 VAL k 
III VAL C 



*l*.Jtfc 
-1A.10T 
-11. API 
-11.114 
•14. |TT 
-12.171 
-12. 110 
-9.216 
•11*192 
-U.140 
-7,*|2 
-4.145 
-1.079 
-T.211 
-1.700 
-4,124 
•1.414 
-1.111 
•2.192 
•I. IT| 
-2.144 
•1,010 
•3.117 
1.71* 
•1.711 
1.121 
• •111 

1.710 

1*143 

4.110 

1.144 
t.101 

1.196 

1.414 
4,113 
1.174 
2.411 
1.T44 
I. Ill 

4.161 

?• 14 7 

1.70 * 

t.112 

t.4Tf 

1*141 

4.121 

1.111 

l.4|* 

4. Ill 

2*414 

1.414 

2.114 

1.147 

1.112 

■•114 

-P. 411 

-1.ITI 

•l.tt! 
1.451 
1.411 
2.114 

-••111 

•2. tit 



12.143 
22.141 
l*.T73 
1A.1TA 
13.114 
24.491 

21.142 

1**134 

II. f »7 

17.111 

I*. 210 

21.113 

23.413 

14.111 

l4.lt! 

14. IIP 

II. 114 

14.117 

24.7TI 

14.411 

14.144 

21.911 

11.111 

11.411 

14.444 

ll.ltl 

12.721 

21.021 

24.211 

11.247 

14*717 

14.121 

24.712 

27.141 

12.117 

26.417 

22.442 

24.122 

21.117 

21.432 

27.713 

21.417 

24.417 

27.131 

21.904 

21.216 

22.211 

21.104 

22.124 

If. 777 

10.247 

11.021 

22.721 

21.140 

23.111 

24.720 

24.141 

11.711 

21.140 

16.162 

21.112 

21.741 

21.141 



-14,011 
-ll.TOi 
•17,121 
•14.414 
•17. f At 
•11.211 
•17.214 
-14.121 
-11, III 
-11.210 
•14.001 
-14.710 
•14.211 
•11.131 
•11.113 
•11. Ill 
-12.094 
-10,99] 
•9.490 
•11.441 
•1.191 
-4.417 
-4.167 
-1.111 
•2.447 
•2,912 
•1.111 
-2.191 
1.121 
•1.121 
2.017 
4.29S 
t. 100 
1.112 
4,970 
4.222 
4.141 
t.112 
4,227 
4.113 
1.912 
4.217 
1.712 
7*97? 
14.121 
I. Ill 
4.910 
t.tll 
4.113 
f .01* 
f.llf 
4.SI9 
T.til 
1.194 
9.410 
1.999 
1.164 
2,111 
4,112 
2.271 
1.492 
1.190 
1.497 



144 ALA tl 

145 $t* £| 
1*1 Ifl C 

l4t irt oft 

141 tlT CA 

14| ALT 0 

147 VAL CI 

147 VAL 0 

147 VAL C&l 

141 VAL * 

ui c 

141 VAL CI 

141 941 Ct2 

249 VAL CA 

149 941 0 

2*1 911 COl 

lit 941 If 

116 VAL C 

111 94L CI 

110 VAL C62 

111 ALI C4 

111 4L4 0 
212 All u 

112 ALA C • 
112 ALA CI 
111 ALA C4 
111 ALA 0 

114 ilt ■ 

114 ILT C 

111 A In ft 

111 AH C 

111 Alt CI 

HI 41V 001 

111 tLU If 

114 ALU e 

111 4LU CI 

114 Si.ii CO 

114 til/ 012 

117 tLT C4 

117 At* 0 

111 Tflt C62 

III 9wt Ct 

111 9*9 C 

219 til * 

119 Sit 09 

111 Id C 
140 ALT A 

140 tLT C 

iti lit k 

141 lit £ 

141 lit CI 

112 lit H 

142 lit C 
112 lit CI 
111 111 ti 
Iti lit C 
111 lit Of 
144 TM ft 
14* Tut C 
114 THI CI 
It* T«l CC2 
111 9AL CA 
111 9Ai 0 



•17.442 
•14.412 
-11.910 
-II. M2 
"11.419 
•11.420 
-10.174 
•11.171 
-9.191 
-I. Ill 
-7.11T 
•1.273 
•4,190 
•4.91? 
-1.424 
-7.193 
-4.712 
-1.117 
-2,274 
•I. 47| 
-2.241 
-0.411 
•0.490 
• •204 
1.244 
f ,140 
4.217 
1.127 
I. 119 
1.911 
1.199 
4.011 
4.121 
4.711 
1.132 
2.261 
2.294 
2.194 
7.104 
1.414 
• •079 
7.144 
4.110 
1.111 
1.47] 
4,444 
1.174 
4.4t4 
1.121 
2.477 
1.244 

1.102 

4.410 
•0.211 
•4.479 
•4.441 
-1.190 

4.127 

••III 

2.491 

1.297 
-4.919 
•2.939 



11.9*1 

24.917 

21.111 

14.911 

21.799 

24.114 

21.114 

21.991 

17.001 

21.911 

14.967 

14,124 

11.432 

14.941 

11.171 

11.419 

11.201 

21.111 

11.111 

14.143 

SI. All 

21.119 

11.917 

24.226 

14.407 

12.211 

22.192 

11.191 

24.049 

14,711 

14,211 

11.190. 

24.141 

13.141 

11.321 

11.900 

11.411 

14,414 

29,917 

11.144 

11.194 

11.144 

24.410 

41*441 

24.111 

12.726 

11.947 

12.041 

10.111 

11.701 

21.291 

11.9*1 

11.112 

21.444 

12.922 

14.177 

14.442 

24.912 

11.214 

11.111 

27.410 

29,142 

11.111 



-11.711 
•14.714 
-11.193 
•11.149 
-21.171 
•29.144 
•14.912 
-11.414 
•11.171 
•14.413 
•14.901 
•14.910 
•11.242 
•12.249 
•11.429 
-11,011 
-11,404 
-4.111 
•21.911 
•11.101 
-7.217 
-4.914 
-1.112 
•4.111 
•4.294 
•1.943 
•0,199 
•1.244 

4.120 
1. 141 

2.442 

2.904 
•1.124 

1.171 

1*111 

1*110 

4.271 

7.144 

4.117 

4.019 • 

1.119 

1.294 

7.117 

7.497 

9,112 

4.444 

I. Ill 

7.711 

4.111 

4*714 

7.371 

7.499 

1.1*1 

1.242 

2.197 

4.111 

1.111 

1.112 

1,194 

4,111 

4.411 

1,011 

1.210 
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lit 


■ 1 1 

T*l 




• . ft * * 

•1. lit 


I 1.42* 


-1.4 


lit 


*■ L 


f* E ■ 

til 


•9.21 1 


17. tl * 


•4*4 


* A * 

iff 




» ■ 


*1*4*1 


22 .7 71 


2*4 


li* 


Wh * 


m 
V 


— *.I2* 


92.114 


•9.9 


!• 7 


lift 


t ft 


■**• 1J1 


ft*. 14* 


9*1 


11* 


1 * ■ 


ft 
V 


—1*41* 
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The above structural studies together with the kinetic 
data presented herein and elsewhere (Philipp, M. , et 
al . (19B3) ml -SSII^ met***. 51, .5-32; Svendsen, 

I.E. (1976) -Carlsbezg Res. poamu 41, 237-291; 

Maryland, S.F. ' 3£; Stauffe,, D.C., et al. (1965) L. 
m 0 l. Ohem. 241,-5333-5338) indicate that the subsites 
in the binding cleft of subtilisin are capable of 
interacting with substrate amino acid residues fro*. 
p-4 to P-2 ' . 

The most extensively studied of the above residues are 
Glyl66, Glyl69 and Alal52. These amino acids were 
identified as residues within the s-1 subsite. As 
seen in rig. 3, which is a stereoview of the S-1 
subsite, Glyl66 and Glyl69 occupy positions at the 
bottom of the S-1 subsite, whereas Alal52 occupies a 
position near the top of S-1, close to the catalytic 
Ser221. 

All 19 amino acid substitutions of Glyl66 and Glyl69 
have been made. As will be indicated in the examples 
which follow, the preferred replacement amino acids 
for Glyl66 and/or Glyl69 will depend on the specific 
amino acid occupying the P-l position of a given 
substrate. 

The only substitutions of Alal52 presently made and 
analyzed comprise the replacement of Alal52 with Gly 
and ser. The results of these substitutions on P-l 
specificity will be presented in the examples. 

In addition to those residues specifically associated 
with specificity for the P-l substrate amino acid, 
T yrl04 has been identified as being involved with P-4 
specificity. substitutions at Phel89 and Tyr217, 



35 



0251 446 

-47- 

however, are expected to respectively effect M - and 
p-l f specificity. 

The catalytic activity of subtilisin has 

»odified by si^le «i«o acid substitutions at Asnlo5. 

The catalytic triad of subtilisin is shovn m Fig- 4. 

. n Ser?£ i f His64 and Asp32 arc positioned 

As can be seen, Ser2«.i, ...-ri-n. 
to facilitate nucleophilic ettech by tee »r » 
hydoxylate on tee csrbonyl =f tee £f ^ 

JL. Crystsllographic studies of * - 

(Kobertus. *. *!• Ei9^ 

Bh ow teat two hydrogen bonds ere forms* „Ji 
Lyanion of tee- substrate transition state One. 
hydrogen bond donor is from tee catalytic «°"»*-221 
.lin-chain amide while the other is fro. on, of -he 
HE 2 protons of the esparagine-155 side chain. 
Fig. 4. 

AsnlSS was substituted with Ala, Asp, His, Glu and 
Thr These substitutions were made to investigate tee 
Te Ttebilisation of tee charged Ut^M 

interests of the transition stete eemp!., c by J. 

.otentiel hydrogen bond between 

£Z£ rtest^r. luTd \erge decreases in 
pa " 1G „„.. ,,„ 0 to 4,ooo fold), marginal 

substrate turnover, kcat <200 to «,™ ,'„,„, „ d 
decreases in substrate binding Km (up to 7 f old, , and 
. loss in transition state stabilisation energy of 2.2 
to 4 7 fcoal/nol. The retention of Km and the drop in 
; c at will »U these mutant enzymes useful as binding 
Trotelns for specific peptide seguenoe. tee nacur.^f 

which will be determined by the specificity of 

precursor protaase. 
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Various other amino acid residues have been identified 
which affect alkaline stability. In BOffle cases , 
mutants having altered alkaline stability also have 
altered thermal stability. 

In S ^vloliquef ac-fons subtilisin residues Asp36 
11-107, Eysl70, Ser204 and Lys213 have been identified 
as residues which upon substitution with a different 
amino acid alter the alkaline stability of the mutated 
w enzyme as compared to the precursor enzyme. The 
substitution of Asp3 6 with Ala and the substitution of 
Lysl70 with Glu each resulted in a mutant enzyme 
having a lower alkaline stability as compared to the 
wild type subtilisin. When Iiel07 was substituted 
with Val, Ser2 04 . substituted with Cys, Arg or Leu or 
Lys213 substituted with Arg, the mutant subtilisin had 
a greater alkaline stability as compared to the wild 
type subtilisin. However, the mutant Ser204P 
demonstrated a decrease in alkaline stability 
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In addition, other residues, identified as being 
associated with the modification of other properties 
of subtilisin, also affect alkaline stability. These 
residues include Ser24, MetSO, cimss, 61yi66, Glyi 69 
and Tyr2l7. Specifically the following particular 
substitutions result in an increased alkaline 
stability: Ser24C, MetSOF, Glyl56Q or S, Glyl66A, H 
K, N or Q, Glyi69S or A, and Tyr217F, K, R or L. The 
mutant KetSOV, on the other hand, results in a 
decrease in the alkaline stability of the mutant 
subtilisin as compared to wild type subtilisin. 

Other residues involved in alkaline stability based on 
the alkaline stability screen include Aspl97 and 
Met222. Particular mutants include Aspi97(R or A) and 
Met 222 (all other amino acids). 
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Various other residMes have been identified as being 
involved in thermal stability as determined by the 
thermal {stability screen herein* These residues 
include the above identified residues which effect 
alkaline stability and Metl99 and Tyr21. These latter 
two residues are also believed to be important for 
alkaline stability. Mutants at these residues include 
119 9 and F21> 

The amino acid saguence of amyloliauef aciens 

substiliein has also been modified by substituting two 
or more amino acids of the wild-typs sequence. Six 
categories of multiply substituted mutant subtilisin 
have beon identified- The first two categories 
comprise thermally and oxidatively stable mutants. 
The ne>:t three other categories comprise mutants which 
combine the useful properties of any of several single 
mutations of ]3* amyloliquef aciens subtilisin. The 
last category comprises mutants which have modified 
alkaline and/or thermal stability. 

The first category comprises double mutants in vhich 
two cysteine residues have been substituted at various 
amino acid residue positions within the subtilisin 
molecule* Formation of disulfide bridges between the 
two substituted cysteine residues results in mutant 
subtilisins with altered thermal stability and 
catalytic activity. These mutants include A21/C22/C87 
and C24/C87 which will be described in more detail in 
Example 11. 

The second category of multiple subtilisin mutants 
comprises mutants which are stable in the presence of 
various oxidizing agents such as hydrogen peroxide or 
peracids* Examples 1 and 2 describe these mutants 
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Which include F50/I124/Q222 , F50/I124, F50/Q222, 
F50/L124/Q222, I124/Q222 and L124/Q222. 

The third category of multiple subtilisin mutants 
comprises mutants with substitutions at position 222 
combined with various substitutions at positions 166 
or 169. These mutants, for example, combine the 
property of oxidative stability of the A222 mutation 
with the altered substrate specificity of the various 
166 or 169 substi tut ions. Such multiple mutants 
include A166/A222, A166/C222, F166/C222, K166/A222, 
K166/C222, V166/A222 and V166/C222. The K166/A222 
mutant subtilisin, for example, has a kcat/Km ratio 
which is approximately two times greater than that of 
the single A222 mutant subtilisin when compared using 
a substrate with phenylalanine as the P-l amino acid. 
This category of multiple mutant is described in more 
detail in Example 12. 

The fourth category of multiple mutants combines 
substitutions at position 156 (Glu to Q or S) with the 
substitution of hys at position 166. Either of these 
single mutations improve enzyme performance upon 
substrates with glutamate as the P-l amino acid. When 
these single mutations are combined, the resulting 
multiple enzyme mutants perform better than either 
precursor. See Example 9. 

The fifth category of multiple mutants contain the 
substitution of up to four amino acids of the B* 
amyloliguefaclens subtilisin sequence. These mutants 
have specific properties which are virtually identicle 
to the properties of the subtilisin from £. 
lichenif oralis . The subtilisin from B. .lichenif ormis 
differs from amvloliouef aeiens . subtilisin at 87 out 
of 275 amino acids. The multiple mutant 
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F50/S156/A169/L217 was found to have similar substrate 
specificity and kinetics to the lichenif ormis enzyme. 
(See Example 13.) However, this is probably due to 
only three of the mutations (S156, A169 and L217) 
which are present in the substrate binding region of 
the enzyme. It is quite surprising that, by snaking 
only three changes out of the 87 different amino acids 
between the sequence of the two enzymes, the. £. 
amyloliquifaciens enzyme was converted into an enzyme 
with properties similar to J3. lichen if ormis enzyme* 
Other enzymes in this series include 
F50/Q156/N166/L217 and F50/S156/L217 • 

The sixth category of multiple mutants includes the 
combination of substitutions at position 107 (lie to 
V) with the substitution of Lys at position 213 with 
Arg, and the combination of substitutions of position 
204 (preferably Ser to C or L but also to all other 
amino acids) with the substituion of Lys at position 
213 with R. Other multiple mutants which have altered 
alkaline stability include Q156/K166, S156/N166, 
S156/K1S6, S156/N1S6 (previously identified as having 
altered substrate specificity) , and F50/S156/A169/L217 
(previously identified as a mutant of J3. 
amvloliouif aciena subtil isin having properties similar 
to subtil isin from B, jl j.chenifoymis ) . The mutant 
F50/V107/R213 was constructed based on the observed 
increase in alkaline stability for the single mutants 
F50, V107 and R213* It was determined that the 
V107/R213 mutant , had an increased alkaline stability 
as compared to the wild type subtilisin* In this 
particular mutant, the increased alkaline stability 
was the result of the cumulative stability of each of 
the individual mutations. Similarly, the mutant 
F50/V107/R213 had an even greater alkaline stability 
as compared to the V107/R213 mutant indicating that 
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the increase in the alkaline stability due to the F50 
mutation was also cumulative. 

Table IV summarizes the multiple mutants which have 
been made including those not mentioned above. 

5 - 

In addition, based in part on the above results, 
substitution at the following residues in subtilisin 
is expected to produce a multiple mutant having 
increased thermal and alkaline stability: Ser24, 

10 Met50, Ilel07, Glul56, Glyl66, Glyl69, Ser204, Lys213, 
Gly215, and Tyr217. 
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Double Mutants 

C22/C87 

C24/C87 

V45/V4B 

C49/C94 

C49/C95 

C50/C95 

C50/C110 

F50/I124 

F50/Q222 

I124/Q222 

Q156/D166 

Q156/K166 

Q156/N166 

S156/D166 

S156/K166 

S156/N166 

S156/A169 

A166/A222 

A166/C223 

F166/A222 

F166/C222 

K166/A222 

K166/C222 

V166/A222 

V166/C222 

A169/A222 

A169/A222 

A169/C222 

A21/C22 



TABLE IV 

Triple, Quadruple 
- or other Multiple 

F50/1124/Q222 

F50/L124/Q222 

F50/L124/A222 

A21/C22/C87 

F50/S156/N166/L217 

F5 0/Q1 5 6/N1 6 6/L2 17 

F50/S156/A169/L217 

F50/S156/L217 

F50/Q156/K166/L2 17 

F50/S156/K166/L217 

F50/Q156/K266/K217 

F50/S156/K166/K217 

F50/V107/R213 

[S153/S156/A158/G159/S16p/Alfil- 

164/X16S/S166/A169/3R17 0] 

L204/R213 

R213/204A, £, Q, D, N, G , K, 
V, R, T, P, 1/ K r F, Y, W 
or H 

V107/R213 



In addition to the above identified amino acid 
residues, other amino acid residues of subtilisin are 
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also considered to be important with regard to 
substrate specificity. Mutation of each of these 
residues is expected to produce changes in the 
substrate specificity of subtilisin. Moreover, 
multiple mutations among these residues and among the 
previously identified residues are also expected to 
produce subtilisin mutants haying novel substrate 
specificity. 

particularly important residues are His67, Ilel07, 
10 Leul26 and Leul35. Mutation of His67 should alter the 
S-l' subsite, thereby altering the specificity of the 
mutant for the P-l' substrate residue i Changes at 
this position could also affect the pH' activity 
profile of the mutant. This- residue was identified 
based on the inventor's substrate- modeling; from 
product inhibitor complexes. 

Ilel07 is involved in P-4 binding. Mutation at this 
position thus should alter specificity' for the • P-4 
substrate residue in addition to the observed effect 
on alkaline stability, Ilel07 was also identified by 
molecular modeling from product inhibitor complexes. 
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The S-2 binding site includes the Leul26 residue. 
Modification at this position should therefore affect 
P-2 specificity. Moreover, this residue is believed 
to be important to convert subtilisin to an amino 
peptidase. The pH activity profile should also be 
modified by appropriate substitution. These residues" 
were identified from inspection of the refined model, 
the three dimensional structure from modeling studies. 
A longer side chain is expected to preclude binding of 
any side chain at the S-2 subsite. Therefore, binding 
would be restricted to subsites S-l, S-1T , .S-2 S-3 ' 
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and cleavage would be forced to <?ccur after the amino 
terminal peptide. 

Leul3 5 is in the S~4 subs its and if mutated should 
alter substrate specificity for P-4 if mutated, This 
5 residue was identified by inspection of the 
three-dimensional structure and modeling based on the 
product inhibitor complex of F222. 

In addition to these sites, specific amino acid 
10 residues within the segments 97-103, 126-129 and 
213-215 are also believed to be important to substrate 
binding • 

Segments 97-103 and 12 6-129 form an antiparallel beta 
i5 sheet with the main chain of substrate residues P-4 
through P-2 - Mutating residues in those regions 
should affect the substrate orientation through main 
chain (enzyme) - main chain (substrate) interactions, 
since the main chain of these substrate residues do 
2o not interact with these particular residues within the 
S-4 through S-2 subsites. 

Within the segment 97-103,. Gly97 and Asp99 may be 
mutated to alter the position of residues 101-103 

25 within the segment. Changes at these sites must- be 
compatible, however. t In B. arayloliouif aciens 
subtilisin Asp99 stabilizes a turn in the main chain 
tertiary folding that affects the direction of 
residues 101-103. g. licheniformis subtilisin Asp97, 

30 functions in an analogous manner. 

In addition to Gly97 and Asp99, EerlOl interacts with 
Asp99 in S- ftmyliquef aciens subtilisin to stabilize 
the same main chain turn. Alterations at this residue 
„ should alter the 101-103 main chain direction. 
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Mutations at Glul03 are also expected to affect the 
101-103 Bain chain direction. 

The side chain of Glyl02 interacts with the substrate 
P-3 amino acid- Side chains of substituted amino 
5 acids thus are expected to significantly affect 
specificity for the P-3 substrate amino acids. 

All the amino acids within the 127-129 segment are 
considered important to substrate specificity. Gly 
10 127 is positioned such that its side chain interacts 
with the S-l and S-3 subsites. Altering this residue 
thus should alter the specificity for P-l and P-3 
residues of the substrate. 

The side chain of Gly 12? comprises a part of both the 
S-2 and S-4 subsites. Altered specificity for P-2 and 
p-4 therefore would be expected upon mutation. 
Moreover, such mutation may convert subtilisin into an 
amino peptidase for the same reasons substitutions of 
Leul26 would be expected to produce that result. 

The Prol29 residue is likely to restrict the 
conformational freedom of the sequence 126-133 , 
residues; which may play a major role in determining 
P-l specificity* Replacing Pro may introduce more 
flexibility thereby broadening the range of binding 
capabilities of such mutants. 

The side chain of Lys213 is located within the S-3 
subsite. All of the amino acids within the 213-215 
segment are also considered to be important to 
substrate specificity. Accordingly, altered P-3 
substrate specificity is expected upon mutation of 
this residue. 
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The Tyr214 residue does not interact with substrate 
but is positioned such that it could affect the 
conformation of the hair pin loop 2 04-217* 

Finally, mutation of the G2y215 residue should affect 
the S-3 1 subsite, and thereby alter P-3 " specificity. 

In addition to the above substitutions of amino acids, 
the insertion or deletion of one or more amino acids 
within the external loop comprising residues 152-172 
may also affect specificity. This is because these 
residues may play a role in the "secondary contact 
region" described in the . model of streptoiayces 
subtilisin inhibitor coxaplexed with subtilisin. 
Hirono, et al. (1984) J, Hoi. Biol, 17 3 , 389-413. 
Thermitase K has a deletion in this region, which 
eliminates several of these "secondary contact" 
residues. In particular, deletion of residues 161 
through 164 is expected to produce a mutant subtilisin 
having modified substrate specificity. In addition, a 
rearrangement in this area induced by the deletion 
should alter the position of many residues involved in 
substrate binding, predominantly at F-l. This, in 
turn, should affect overall activity against 
proteinaceous substrates. 

The effect of deletion of residues 161 through 164 has 
been shown by comparing the activity of the wild type 
(WT) enzyme with a mutant enzyme containing this 
deletion as well as multiple substitutions (i.e., 
S153/S156/A158/G159/S160/A161-164/I165/S166/A169/ 
K170) . This produced the following results: 
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kcat Km kcat/Km 

W T 50 1.4xl0" 4 3.6xl0 5 

Deletion mutant 8 S.OxlCT 6 1.6xl0 6 



The WT has a kcat 6 times greater than the deletion 
mutant but substrate binding is 2 8 fold tighter by the 
deletion mutant. The overall efficiency of the 
10 deletion mutant is thus 4.4 times higher than the WT 
enzyme • 



All of these above identified residues which have yet 
to be substituted , deleted or inserted into are 
15 presented in Table VI. 



TABLE VI 
Substitution/Insertion/Deletion 

20 

Residues 
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His 6 7 


Alal52 


Leul26 


Alal53 


Leul35 


Glyl54 


Gly97 


Asnl55 


Asp 9 9 


Glyl56 


SerlOl 


Glyl57 


Glyl02 


Glyl60 


Glul03 


Tlirl58 


Leul2 6 


Serl59 


Glyl27 


Serl61 


Glyl28 


Serl62 


Prol29 


Serl63 


Tyr214 


Thrl64 


Gly215 


Vall65 


Glyl66 


Glyl69 


Tyrl67 


Lysl70 


Prol68 


Tyrl71 




Prol72 
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The following disclosure is intended to serve as a 
representation of embodiments herein, and should not 
be construed as limiting the scope of this 
application. These specific examples disclose the 
construction of certain of the above identified 
mutants. The construction of the other mutants, 
however, is apparent from the disclosure herein and 
that presented in EPO Publication No. 0130756. 

All literature citations are expressly incorporated by 
reference . 



EXAMPLE 1 



Identification of Peracid Oxidizable 
15 Residues of Subtilisin Q222 and L222 

As shown in Figures 6A and 6B, organic peracid 
oxidants inactivate the mutant subtilisins Met222t and 
Met222Q (L222 and Q222) . This example describes the 
identification of peracid oxidizable sites in these 
20 mutant subtilisins. 



First, the type of amino acid involved in peracid 
oxidation was determined. Except under drastic 
conditions (Means, G.E., et al. (1971} Chemical 
Modifications of Proteins , Holden-Day, S*F,, CA, 
pp, 160-162} , organic peracids modify only methionine 
and tryptophan in subtilisin. Difference spectra of 
the enzyme over the 250nm to 350nm range were 
determined during an inactivation titration employing 
the reagent, diperdodecanoic acid (DPDA) as oxidant. 
Despite quantitative inactivation of the enzyme, no 
change in absorbance over this wavelength range was 
noted as shown in Figures 7A and 7B indicating that 
tryptophan was not oxidized. Fontana, A. , et al. 
(19 8-0) Methods in Peptide and Protein Sequence 
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Analvsis (C. Birr ed. } Elsevier, New York, p- 309. 
The absence of tryptophan modification implied 
oxidation of one or more of the remaining methionines 
of B. amyloliguefaciens subtilisin. See Figure 1. 

5 To confirm this result the recombinant subtilisin 
Met222F was cleaved with cyanogen bromide (CKBr) both 
before and after oxidation by DPDA. The peptides 
produced by CKBr cleavage were analyzed on high 
resolution SDS-pyridine peptide gels (SPG) . 

10 

Subtilisin Met222F (F222) was oxidized in the 
following manner. Purified F222 was resuspended in 
0.1 M sodium borate pH 9.5 at 10 mg/ml and was added 
to a final concentration of 26 diperdodecanoic acid 

15 (DPDA) at 26 mg/ml was added to produce an effective 
active oxygen concentration of 30 ppm. The sample was 
incubated for at least 30 minutes at room temperature 
and then quenched with 0.1 volume of 1 M Tris pH 8.6 
buffer to produce a final concentration of 0.1 M Tris 

20 pH 8.6). 3mM phenylmethylsulfonyl fluoride (PMSF) was 
added and 2.5 ml of the sample was applied to a 
Pharmacia PD10 column equilibrated in 10 mM sodium 
phosphate pH 6.2, 1 mM PMSF. 3.5 ml of 10 mM sodium 
phosphate pH6.2, lmM PMSF was applied and the eluant 

25 collected. 



F222 and DPDA oxidized F222 were precipitated with 9 
volumes of acetone at -20°C. The samples were 
resuspended at 10 mg/ml in 8M urea in 88% formic acid 

30 and allowed to sit for 5 minutes. An equal volume of 
200 mg/ml CKBr in 8 8% formic acid was added (5 mg/ml 
protein) and the samples incubated for 2 hours at room 
temperature in the dark. Prior to gel 

electrophoresis, the samples were lyophilized and 

35 resuspended at 2-5 mg/ml in sample buffer (1% 
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pyridine/ 5% NaDod30 4 , 5% glycerol and bromophenol 
blue) and disassociated at 95°C for 3 minutes. 

The samples were electrophoresed on discontinuous 
polyacrylamide gels (Kyte, J., et al. (1983) 
5 Anal, Bioch, 133 f 515-522). The gels were stained 
using the Pharmacia silver staining technique 
(Sammons, D.T--. # et al. (1981) Electrophoresis 2 
135-141) . 

10 The results of this experiment are shown in Figure 8. 
As can be seen, F222 treated with CNBr only gives nine 
resolved bands on SPG- However, when F222 is also 
treated with DPDA prior to cleavage, bands X, 7 and 9 
disappear whereas- bands 5 and 6 are greatly increased 

15 in intensity. 

In order to determine which of the methionines were 
effected, each of the CNBr peptides was isolated by 

2o reversed phase HPLC and further - characterized . The 
buffer system in both Solvent A (aqueous) and Solvent 
B (organic) for- all HPLC separations was 0.05% 
triethylamime/trifloroacetic acid {TEA-TFA) , In all 
cases unless noted, solvent A consisted of 0.05% 

25 TEA-TFA in H 2 0, solvent B was 0.05% TEA-TFA in 
a -propanoic and the flow rate was 0,5 ml /minute. 

For HPLC analysis, two injections of 1 mg enzyme 
digest were used. Three samples were acetone 
3 q precipitated, washed and dried. The dried 1 mg 
samples were resuspended at 10 mg/ml in 8M urea, 8S% 
formic acid; an equal volume of 20 0 mg/ml CNBr in 8B% 
formic acid was added (5 mg/ml protein) . After 
incubation for 2 hours in the dark at room 
temperature, the samples were desalted on a 0 . 8 cm X 7 



0251446 

-62- 

cm column of Tris Acryl GF05 coarse resin (IBF, Paris, 
France) equilibrated with 40% solvent B, 60% solvent 
A. 200 ul samples were applied at a flow rate of 1 ml 
a minute and 1.0-1.2 ml collected by monitoring the 
absorbance at 280nm- Prior to injection on the HPLC, 

5 each desalted sample was diluted with 3 volumes of 
solvent A. The samples were injected at 1.0 ml/min (2 
minutes) and the flow then adjusted to 0.5 ml/min 
(100% A) - After 2 minutes, a linear gradient to 60% E 
at 1,0% B/min was initiated. From each 1 mg run, the 

10 pooled peaks were sampled (50ul) and analyzed by gel 
electrophoresis as described above. 

Each polypeptide isolated by reversed phase HPLC was 
further analyzed for homogeneity by SPG . The position 
15 of each peptide on the known gene sequence (Wells, 
J. A., et al. (1983) Nucleic Acidfe Res. 11 7911-7924) 
was obtained through a combination of amino acid 
compositional analysis and, where needed, amino 
terminal sequencing. 

20 

Prior to such analysis the following peptides were to 
rechromatographed. " 

1. CNBr peptides from F222 not treated with DPDA: 

25 

Peptide 5 was subjected to two additional reversed 
phase separations. The 10 cm C4 column was 
equilibrated to 80%A/ 20%B and the pooled sample 
applied and washed for 2 minutes. Next an 0.5% ml 
30 B/min gradient was initiated. Fractions from this 
separation were again rerun, this time on the 25 cm C4 
column, and employing 0.05% TEA-TFA in 
acetonitrile/l-propanol (1:1) for solvent B. The 
gradient was identical to the one just described. 

35 
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Peptide "X" was subjected to one additional separation 
after the initial chromatography. The sample vas 
applied and washed for 2 minutes at 0.5ml/min (100%A), 
and a 0.5% ml B/min gradient was initiated. 

Peptides 7 and S were rechroma tographed in a similar 
manner to the first rerun of peptide 5. 



Peptide 8 was purified to homogeneity after the 
initial separation - 

10 

2. CNBr Peptides from DPDA Oxidized F222: 

Peptides 5 and 6 from a CNBr digest of the oxidized 
F222 were purif-ied in the same manner as peptide 5 
from the untreated enzyme, 

15 

Amino acid compositional analysis was obtained as 
follows. Samples (-InM each amino acid) were dried, 
hydrolyzed in vacuo with 100 ul 6N HC1 at 106°C for 24 
hours and then dried in a Speed Vac. The samples were 

2 0 

analyzed on a BecJcmann 630 0 AA analyzer employing 
ninhydr in detection . 

Amino terminal sequence data was obtained as 
previously described (Rodriguez, H., et al. (19B4) 
Anal. Biochem. 134 , 538-547) . 

The results are shown in Table VII and Figure 9, 



30 
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TABLE VII 

Amino and COOH terminii of CNBr fragments 
Terminus a nd Method 
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Fragment 

X 
9 
7 
8 

5 OX 

6 ox 



amino, method 

1, sequence 
51, sequence 
125 , sequence 

200, sequence 

1, sequence 

120, composition 



cooH , method 

50, composition 

119, composition 

199, composition 

275, composition 

119, composition 

199, composition 



Peptides Sox and 6ox refer to peptides 5 and 6 
isolated from CNBr digests of the oxidized protein 
where their respective levels are enhanced- 

From the data in Table VII and the comparison of SPG 
tracks for the oxidized and native protein digests in 
Figure 8, it is apparent that (1) Met50 is oxidized 
leading to the loss of peptides X and 9 and the 
appearance of 5; and (2) Metl24 is also oxidized 
leading to the loss of peptide 7 and the accumulation 
of peptide 6* Thus oxidation of B> amyloliguif aciens 
subtilisin with the peracid, diperdocecanoic acid 
leads to the specific oxidation of methionine at 
residues 50 and 124. 

EXAMPLE 2 
30 

Substitution at MetSO and Met 12 4 
in Subtili sin Met222Q 

The choice of amino acid for substitution at Met50 was 
based on the available sequence data for subtilisins 
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from B. lichenifomHft (Smith, E.C., et al. (1968) 
J, Biol* Chem. 143, 2184-2191), B.DY (Nedkov, P. , et 
al. (1983) Hoppe Savler's. z. Phvsiol, Chem. 364 
1537-1540), B. amvlosa cchariticus (Markland, F.S., et 
al. (1967) J. Biol. Che^ 242 5198-5211) and B . 
subtilis (Stahl, M* L. , et al. (1984) J, Bacterid. 
158, 411-418). In all cases, position 50 is a 
phenylalanine. See Figure 5. Therefore, Phe50 was 
chosen for construction. 

At position 124, all known subtilisins possess a 
methionine. See Figure 5. Molecular modelling of the 
x-ray derived protein structure was therefore required 
to determine the most probable candidates for 
substitution. From all 19 candidates, isoleucine and 
leucine were chosen as the best residues to employ. 
In order to test whether or not modification at one 
site but not both was sufficient to increase oxidative 
stability, all possible combinations were built on the 
Q222 backbone (F50/Q222, I124/Q222, F50/I124/Q222 ) . 

A. Construction of Mutations 
Between Codons 4 5 and 5 0 

All manipulations for cassette mutagenesis were 
carried out on pS4.5 using methods disclosed in EPO 
Publication No. 0130756 and Wells, J. A., et al, (1985) 
gene 34, 315-323. The p*50 in Fig* 10, line 4, 
mutations was produced using the mutagenesis primer 
shown in Fig. 10, line 6, and employed an approach 
designated as restriction-purification which is 
described below. Briefly, a M13 template containing 
the subtilisin gene, M13mpll-SUBT was used for 
heteroduplex synthesis (Adelman, et al (1983), DNA 2, 
183-193). Following transfection of JM101 (ATCC 
33876), the 1,5 kb EcoRI-BamHl fragment containing the 
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subtilisin gene was subcloned from M13mpll SUBT rf 
into a recipient. vector fragment of pBS4 2 the 
construction of which is described in EPO Publication 
No. 0130756. To enrich for the mutant sequence (pA50, 
line 4), the resulting plasmid pool was digested with 
5 Kpn l, and linear molecules were purified by 
polyacrylamide gel electrophoresis. Linear molecules 
were ligated back to a circular form, and transformed 
into coli MM294 cells (ATCC 31446). Isolated 

plasmids were screened by restriction analysis for the 
10 Ktm l site. Kpn l + plasmids were sequenced and 
confirmed the p£5D sequence. Asterisks in Figure 11 
indicate the bases that are mutated from the wid type 
sequence (line 4). pa50 (line 4) was cut with StuI 
and pcpRI and the 0.5 Kb fragment containing the 5' 
15 half of the subtilisin gene was purified (fragment 1) . 
pa 50 (line 4) was digested with ffpn l and Eco RI and the 
4.0 Kb fragment containing the 3 1 half of the 
subtilisin gene and vector sequences was purified 
(fragment 2). Fragments 1 and 2 (line 5), and duplex 
20 DNA cassettes coding for mutations desired (shaded 
sequence, line 6) were mixed in a molar ratio of 
1:1:10, respectively. For the particular construction 
of this example the DNA cassette contained the triplet 
TTT for codon 50 which encodes Fhe. This plasmid was 
25 designated pF50. The mutant subtilisin was designated 
F50. 



B. Construction of Mutation 
30 petween Codons 1 22 and 127 

The procedure of Example 2A was followed in 
substantial detail except that the mutagenesis primer 
of Figure 11, line 7 was used and restriction- 
purification for the EcoRV site in pM24 was used. In 
35 addition, the DNA cassette (shaded sequence, Figure 
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11, line 6) contained the triplet ATT for codon 124 
which encodes He and CTT for Lieu* Those plasmids 
which contained the substitution of He for Metl24vere 
designeated pli24. The mutant subtilisin was 
designated 1124. 

5 

C Construction of Various 

F5D/I124/0222 Multiple Mutants 

The triple mutant, F50/I124/Q222 # was constructed from 
10 a three-way ligation in which each fragment contained 
one of the three mutations. The single mutant Q222 
(pQ222) was prepared by cassette mutagenesis as 
described in EPO Publication No, 0130756. The F50 
mutation was contained on a 2.2kb Ava il to PvuII 
15 fragment from pF50; the 1124 mutation was contained on 
a 260 bp PvuII to £vall fragment from pI124; and the 
Q222 mutation was contained on 2.7 kb Ava il to Ava il 
fragment from pQ222. The three fragments were ligated 
together and transformed into coli MH294 cells. 

20 Restriction analysis of plasmids from isolated 
trans formants confirmed the construction. To analyze 
the final construction it was. . convenient that the 
Ava il site at position 798 in the wild-type subtilisin 
gene was eliminated by the 1124 construction. 

25 

The F50/Q2 22 and I124/Q22 2 mutants were constructed in 
a similar manner except that the appropriate fragment 
from pS4.5 was used for the final construction. 

30 

D. Oxidative Stability of Q222 Mutants 
The above mutants were analyzed for stability to 
peracid oxidation. As shown in Fig* 12, upon 
incubation with diperdodecanoic acid (protein 2mg/mL, 
35 oxidant 75ppm[0]), both the I124/Q222 and the 
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F50/I124/Q222 are completely stable whereas the 
F50/Q222 and the Q222 are inactivated. This indicates 
that conversion of Metl24 to 1124 in subtilisin Q222 
is sufficient to confer resistance to organic peracid 
oxidants - 



EXAMPLE 3 

Subtilisin Mutants Having Altered 
10 substrate Specif icity-Hydrophobic 
Substitutions at Residues 166 

Subtilisin contains an extended binding cleft which is 
hydrophobic in character, A conserved glycine at 
residue 166 was replaced with twelve non-ionic amino 
15 acids which can project their side-chains into the S-l 
subsite. These mutants were constructed to determine 
the effect of changes in size and hydrophobicity on 
the binding of various substrates* 

20 

A* Kinetics for Hydrolysis of Substrates 
Having Altered P-l Amino Acids by 
Subtilisin from B, Amvloliquef aciens 

Wild-type subtilisin was purified from J3. subtilis 
25 culture supematants expressing the B. amylolique - 
faciens subtilisin gene (Wells, J. A., et al. (1983) 
Nucleic Acids Res. 11, 7911-7925) as previously 
described (Estell, D,A. f gt fil, (19851 J, Biol- Chem. 
260 . 6518-6521) . Details of the synthesis of 
J0 tetrapeptide substrates having the form 
succinyl-L-AlaL-AlaL-ProL-[X]-p-nitroanilide (where X 
is the PI amino acid) are described by DelHar, E.G., 
et al- (1979) Anal. Biochem. £9, 316-320* Kinetic 
parameters, Knt(M) and kcat(s" 1 ) were measured using a 
5 modified progress curve analysis {Estell, D.A., £t al. 
(1985) J. Biol. Chem. 260 , 6518-6521). Briefly, plots 
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of rate versus product concentration were fit to the 
differential form of the rate equation using a 
non-linear regression algorithm. Errors in kcat and 
Km for all values reported are less than five percent. 
The various substrates in Table VIII are ranged in 
order of decreasing hydrophobicity . Nozaki, V. 

(1971*, J. Biol. Chem. 246, 2211-2217; Tanford C. 

(1978) Science 200, 1012). 



10 



TABLE VIII 



3.5 



20 



25 



30 



PI substrate 
Amino Acid 


kcat (S -1 ) 


l/Km{M _1 ) 


kcat/Km 
(s- 1 M-l) 


Phe 


50 


7,100 


360, 000 


Tyr 


28 


40,000 


1,100,000 


Leu 


24 


3,100 


75,000 


Met 


13 


9,400 


120,000 


His 


7.9 


1,600 


13,000 


Ala 


1.9 


5,500 


11,000 


Gly 


0.003 


8 ,300 


21 


Gin 


3.2 


2,200 


7,100 


Ser 


2.8 


1 ,500 


4,200 


Glu 


0.54 


32 


16 



The ratio of kcat/Km (also referred to as catalytic 
efficienty) is the apparent second order rate constant 
for the conversion of free enzyme plus substrate (E+S) 
to enzyme plus products (E+F) (Jencks, W.P. f Catalysis 
in Chemistry and Enzymology {McGraw-Hill, 1969) pp. 
321-436? Fersht, A. , Enzyme Structure and Mechanism 
{Freeman, San Francisco, 1977) pp* 226-287) • The log 
(kcat/Km) is proportional to transition state binding 
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energy, aG^. A plot of the log kcat/Km versus the 
hydrophobicity of the PI side-chain (Figure 14) shows 
a strong correlation {r « 0.98), with the exception of 
the glycine substrate which shows evidence for 
non-productive binding. These data show that relative 
differences between transition-state binding energies 
can be accounted for by differences in P-l side-chain 
hydrophobicity. When the trans it ion- state binding 
energies are calculated for these substrates and 
plotted versus their respective side-chain 
hydrophobicities, the line slope is 1.2 (not shown). 
A slope greater than unity # as is also the case for 
chymotrypsin (Fersht, A* , Enzyme Structure and 
Mechanism (Freeman, San Francisco, 1977) pp. 226-287; 
Harper, J.W. , et al . (1984) Biochemistry , 22,, 
2995-3002} , suggests that the PI binding cleft is more 
hydrophobic than ethanol or dioxane solvents that were 
used to empirically determine the hydrophobicity of 
amino acids (Nozaki, Y. , et al. J. Biol. Chem. (1971) 
246, 2211-2217; Tanford, C. (1978) Science 2 00 , 1012). 

For amide hydrolysis by subtil is in, kcat can be 
interpreted as the- acylation rate constant and Km as 
the dissociation constant, for the Michaelis complex 
(E*S), Ks. Gutfreund, H. # et al (1956) Biochem. J. 63, 
656 ♦ The fact that the log kcat, as well as log 1/Km, 
correlates with substrate hydrophobicity is consistent 
with proposals (Robertus, J.D., £t &1. (1972) 
Biochemistry 11, 2439-2449; Robertus, J.D., et al. 
(1972) Biochemistry 11 , 4293-4303) that during the 
acylation step the P-l side-chain moves deeper into 
the hydrophobic cleft as the substrate advances from 
the Michaelis complex (E»S) to the tetrahedral 
transition-state complex (E^s'*). However, these data 
can also be interpreted as the hydrophobicity of the 
PI side-chain effecting the orientation, and thus the 
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susceptibility of the scissile peptide bond to 
nucleophilic attack by the hydroxy 1 group of the 
catalytic Ser221. 

The dependence of kcat/Km on P-l side chain 
hydrophobicity suggested that the . kcat/Km for 
hydrophobic substrates Bay be increased by increasing 
the hydrophobicity of the 3-1 binding subsite. To 
test this hypothesis, hydrophobic amino acid 
substitutions of Glyi66 were produced. 

Since hydrophobicity of aliphatic side-chains is 
directly proportional to side-chain surface area 
(Rose, G.D., et al. (1985) Science 229 . 834-838; 
Reynolds, J.A.,~ et al. (1974) Froc. Natl. Acad, Sci, 
USA 71, 2825-2927) , increasing the hydrophobicity in 
the S-l subsite may also sterically hinder binding of 
larger substrates. Because of difficulties in 
predicting the relative importance of these two 
opposing effects, we elected to generate twelve 
non-charged mutations at position 166 to determine the 
resulting specificities against non-charged substrates 
of varied size and hydrophobicity. 



B. Cassette Mutagenesis of 
the PI Binding Cleft 

The preparation of mutant subtilisims containing the 
substitution of the hydrophobic amino acids Ala, Val 
and Phe into residue 166 has been described in EPO 
Publication No, 013 0756. The same method was used to 
produce the remaining hydrophobic mutants at residue 
166. In applying this method, two unique and silent 
restriction sites were introduced in the subtil isin 
genes to closely flank the target codon 166. As can 
be seen in Figure 13, the wild type sequence (line 1) 
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was altered by site-directed mutagenesis in M13 using 
the indicated 37mer mutagenesis primer, to introduce a 
13 bp delection (dashedline) and unique Sad and JCmal 
sites (underlined sequences) that closely flank codon 
166. The subtilisin gene fragment was subclones back 

5 into the E. coli - B. subtil is shuttle plasmid, pB342, 
giving the plasmid p*166 (Figure 13, line 2), p^l66 
was cut open with SacT and Xma l . and gapped linear 
molecules were purified (Figure 13, line 3). Pools of 
synthetic oligonucleotides containing the mutation of 

10 interest were annealed to give duplex DNA cassettes 
that were ligated into gapped pAl66 (underlined and 
overlined sequences in Figure 13, line 4). This 
construction restored the coding sequence except over 
position 166 (NNN; line 4). Mutant sequences were 

15 confirmed by dideoxy sequencing. Asterisks denote 
sequence changes from the wild type sequence* 
Plasmids containing each mutant B. amvlolig uefaciens 
subtilisin gene were expressed at roughly equivalent 
levels in a protease deficient strain of B. subtil is , 

20 BG2036 as previously described. EPO Publication No. 
0130756; Yang, H» , et al. (1984) J- Bacterid, 160, 
15-21; Estell, D.A.", et a! (1985) J. Biol, Chem. 260, 
6518-6521. 

25 

C. Narrowing Substrate Specificity 
bv Steric Hindrance 

To probe the change in substrate specificity caused by 
steric alterations in the S-l subsite, position 166 

30 mutants were kinetically analyzed versus PI substrates 
of increasing size (i.e., Ala, Met, Phe and Tyr) . 
Ratios of kcat/Km are presented in log form in 
Figure 15 to allow direct comparisons of transition- 
state binding energies between various enzyme- 

35 substrate pairs. 
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According to transition state theory, the free enery 
difference between the free enzyme plus substrate 
(E + S) and the transition state complex (E»S^) can be 
calculated from equation (1) , 

5 (1) *G;jJ - -RT In kcat/Km + RT In kT/h 

in which kcat is the turnover number, Km is the 
Michaelis constant, R is the gas constant, T is the 
temperature, k is Boltzmann's constant, and h is 
10 Planck* s constant. Specificity differences are 
ezpressed quantitatively as differences between 
transition state binding energies {i,e w aaG^) , and 
can be calculated from equation (2)* 

15 (2) Lh <^ = -RT In (kcat/Km) A / (kcat/Km) B 

A and B represent either two different substrates 
assayed againt the same enzyme, or two mutant enzymes 
assayed against the same substrate, 

20 

As can be seen from Figure ISA, as the size of the 
side-chain at position 166 increases the substrate 
preference shifts from large to small P-l side-chains* 
Enlarging the side-chain at position 166 causes 
kcat/Km to decrease in proportion to the size of the 
P.-l substrate side-chain (e.g., from Glyl66 
(wild-type) through W166, the kcat/Km for the Tyr 
substrate is decreased most followed in order by the 
Phe, Met and Ala P-l substrates) . 

30 

Specific steric changes in the position 166 
side-chain, such as he presence of a £ -hydroxy 1 group, 
or 7 -aliphatic branching, cause large decreases in 
kcat/Km for larger PI substrates. Introducing a 
p -hydroxy 1 group in going from A166 (Figure 15 A) to 
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S166 (Figure 15B) , causes an 8 fold and 4 fold 
reduction in kcat/Km for Phe and Tyr substrates , 
respectively, while the values for Ala and Het 
substrates are unchanged. Producing a ^-branched 
structure, in going from S166 to T166, results in a 

5 drop of 14 and 4 fold in kcat/Km for Phe and Tyr, 
respectively. These differences are slightly 
magnified for V166 which is slightly larger and 
isosteric with T166. Enlarging the ^-branched 
substituents from V166 to 1166 causes a lowering of 

10 kcat/Km between two and six fold toward Het, Phe and 
Tyr substrates- Inserting a 7 -branched structure, by 
replacing M166 (Figure ISA) with L166 (Figure 15B) , 
produces a 5 fold and 18 fold decrease in Jccat/Km for 
Phe and Tyr substrates, respectively. Aliphatic 

15 7 -branched appears to induce less steric hindrance 
toward the Phe P-l substrate than 0 -branching, as 
evidenced by the 100 fold decrease in kcat/Kin for the 
Phe substrate in going from L166 to 1166. 

20 Reductions in kcat/Km resulting from increases in side 
chain size in the S-l subsite, or specific structural 
features such as 0~ and 7 -branching, are quantita- 
tively illustrated in Figure 16. The kcat/Km values 
for the position 166 mutants determined for the Ala, 

25 Met, Phe, and Tyr P-l substrates (top panel through 
bottom panel, respectively), are plotted versus the 
position 166 side-chain volumes (Chothia, C. (1984) 
Ann, Rev. Biochem. 53, 537-572). Catalytic efficiency 
for the Ala substrate xeaches a maximum for 1166, and 

30 f or the Met substrate it reaches a. maximum between 
V166 and L166. The Phe substrate shows a broad 
kcat/Km peak but is optimal with A166* Here, the 
^-branched position 166 substitutions form a line that 
is parallel to, but roughly 50 fold lower in kcat/Km 

35 than side-chains of similar size Ei*e-t C166 versus 
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T166, L166 versus 1166], The Tyr substrate is most 
efficiently utilized by wild type enzyme (Glyl66) , and 
there is a steady decrease as one proceeds to large 
position 166 side-chains. The ^-branched and 
7-branched substitutions form a parallel line below 
5 the other non-charged substitutions of similar 
molecular volume* 



The optimal substitution at position 166 decreases in 
volume with increasing volume of the PI substrate 

10 C A - e w 1166/Ala substrate, L166/Het substrate, 
A166/Fhe substrate, Glyl66/Tyr substrate]. The 
combined volumes for these optimal pairs may 
approximate the volume for productive binding in the 
3-1 subsite. - For the optimal pairs, Glyl66/Tyr 

15 substrate, A166/Phe substrate, L166/Met substrate, 
V166/Met substrate, and 1166/Ala substrate, the 
combined volumes are 266,295,313,339 and 261 A 3 , 
respectively. Subtracting the volume of the peptide 
backbone from each pair (i.e., two times the volume of 
2o glycine) , an average side-chain volume of 160±32A 3 for 
productive binding can be calculated. 

The effect of volume, in excess to _the productive 
binding volume, on the drop in transition-state 

25 binding energy can be estimated from the Tyr substrate 
curve (bottom panel, Figure 16), because these data, 
and modeling studies (Figure 2} , suggest that any 
substitution beyond glycine causes steric repulsion. 
A best-fit line drawn to all the data (r - 0.87) gives 

30 a slope indicating a loss of roughly 3 kcal/mol in 
transition state binding energy per 100A 3 of excess 
volume. (100A 3 is approximately the size of a leucyl 
side-chain.) 

35 
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D. Enhanced Catalytic Efficiency 

Correlates with Increasing Hydrophobicity 
nf the Position 166 Sub stitution 

Substantial increases in kcat/Km occur with 
enlargement of the position 166 side-chain, except for 
5 the Tyr P-l substrate (Figure 16). For example, 
kcat/Km increases in progressing from Glyl66 to 1166 
for the Ala substrate (net of ten-fold) , from Glyl66 
to L166 for the Met substrate (net of ten-fold) and 
from Glyl66 to A166 for the Phe substrate (net of 
10 two-fold) . The increases in kcat/Km cannot be 
entirely explained by the attractive terms in the van 
der Waals potential energy function because of their 
strong distance dependence (1/r 6 ) and because of the 
weak nature of these attractive forces (Jencks, W.P., 
15 Catalysis in Chemistry and En zvmoloav (McGraw-Hill , 
1969) pp. 321-436; Fersht, A., Enzvme S tructure and 
Mechanism (Freeman, San Francisco, 1977) pp. 226-287; 
Levitt, M. (1976) J. Mol. Biol. iM, 59-107) . For 
example, Levitt (Levitt, M. (1976) J. Mol. Biol. 104, 
20 59-107) has calculated that the van der Waals 
attraction between two methionyl residues would 
produce a maximal interaction energy of roughly -0.2 
kcal/mol. This energy would translate to only 1.4 
fold increase in kcat/Km. 

The increases of catalytic efficiency caused by 
side-chain substitutions at position 166 are better 
accounted for by increases in the hydrophobicity of 
the S-l subsite. The increase kcat/Km observed for 
the Ala and Met substrates with increasing position 
166 side-chain size would be expected, because 
hydrophobicity is roughly proportional to side-chain 
surface area (Rose, G.D., e£ al. (19B5) Science 229, 
834-838; Reynolds, J. A. , fit Al. (1974) Proc. Natl. 
fread. Sci. USA 71, 2825-2927) . 



25 



30 



35 



0251446 

-77- 

Another example that can be interpreted as a 
hydrophobic effect is seen when comparing kcat/Km for 
isosteric substitutions that differ in hydrophobicity 
such as S166 and C166 (Figure 16) . Cysteine is 

5 considerably more hydrophobic than serine (-1.0 versus 
+ 0.3 kcal/mol) (Nozaki, Y., et al. (1971) J * Biol. 
Chenu 246, 2211-2217; Tanford, C. (1978} Science 200, 
1012). The difference in hydrophobicity- correlates 
with the observation that C166 becomes more efficient 

1Q relative to Serl66 as the hydroph obi city of the 
substrates increases (i.e., Ala < Met < Tye < Phe) . 
Steric hindrance cannot explain these differences 
because serine is considerably smaller than cysteine 
(99 versus 118A 3 ). Paul, I.C., Chemistry of the -sh 
15 Group (ed. s. ~Patai, Wiley Interscience, New York, 
1974) pp. 111-149. 



20 



25 



30 



35 



E. Production of an Elastase-Like 
Specificity in Subtilisin 

The 1166 mutation illustrates particularly well that 
large changes in specificity can be produced by 
altering the * structure and hydrophobicity of the S-l 
subsite by a single mutation (Figure 17) . Progressing 
through the small hydrophobic substrates, a maximal 
specificity improvement over wild type occurs for the 
Val substrate (16 fold in kcat/Km) . As the substrate 
side chain size increases, these enhancements shrink 
to near unity (i.e., Leu and His substrates). The 
1166 enzyme becomes poorer against larger aromatic 
substrates of increasing size (e.g., 1166 is over 
1,000 fold worse against the Tyr substrate than is 
Glyl66) . We interpret the increase in catalytic 
efficiency toward the small hydrophobic substrates for 
1166 compared to Glyl66 to the greater hydrophobicity 
of isoluecine (i*e. , -1.8 kcal/mol versus 0). Nozaki, 
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Y. , et al. (1971) J. Biol. Chem. 246, 2211-2217; 
Tanford, C. (1978) Science 200 . 1012. The decrease in 
catalytic efficiency toward the very large substrates 
for 1166 versus Glyl66 is attributed to steric 
repulsion, 

5 

The specificity differences between Glyl66 and 1166 
are similar to the specificity differences between 
chymotrypsin and the evolutionary xelative, elastase 
(Harper, J.W., et al (1984) Biochemistry £3, 

10 2995-3002). In elastase, the bulky amino acids, Thr 
and Val, block access to the P-l binding site for 
large 'hydrophobic substrates that are preferred by 
chymotrypsin. In addition, the catalytic efficiencies 
toward small hydrophobic substrates are greater for 

15 elastase than for chymotrypsin as we obeseve for 1166 
versus Glyl66 in subtilisin. 

EXAMPLE 4 

20 Substitution of Ionic Amino 
Acids for G1V166 : 

The construction- of subtilisin mutants containing the 
substitution of the ionic amino acids Asp, Asn, Gin, 
!Lys and Ang are disclosed in EP0 Publication No. 

25 0130756. The present example describes the 
construction of the mutant subtilisin containing Glu 
at position 166 (E166) and presents substrate 
specificity data on these mutants. Further, data on 
position 166 and 156 single and double mutants is 

30 presented infra. 

p*166, described in Example 3, was digested with SacI 
and Xmal. The double strand DNA cassette (underlined 
and overlined) of line 4 in Figure 13 contained the 

35 
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triplet GAA for the codon 166 to encode the 
replacement of Glu for Glyl66. This mutant plasmid 
designated PQ166 was propagated in BG2036 as 
described. This mutant subtil is in, together with the 
other mutants containing ionic substituent amino acids 
at residue 166, were isolated as described and further 
analyzed for variations in substrate specificity. 

Each of these mutants was analyzed with the 
tetrapeptide substrates, succinyl-L-AlaL-AlaProL-X- 
-p-nitroanilide, where X was Phe, Ala and Glu. 

The results of this analysis are shown in Table IX. 



15 



TABLE IX 



Position 166 



P-l Substrate 
(kcat/Xm x 10~ 4 ) 

Al3 Glu 



20 



25 



Gly (wild type) 

Asp (D) 

Glu (E) 

Asn (N) 

Gin (Q) 

Lys (K) 

Arg (R) 



36.0 
0.5 
3.5 

ie.o 

57.0 
52.0 
42.0 



1.4 
0.4 
0.4 
1.2 
2.6 
2.8 
5.0 



0. 002 
<0. 001 
<0.001 
0.004 
0.002 
1.2 
0.08 



These results indicate that charged amino acid 
substitutions at Glyl66 have improved catalytic 
efficiencies (kcat/Km) for oppositely charged P-l 
substrates (as much as 500 fold) and poorer catalytic 
efficiency for like charged P-l substrates. 



35 
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EXAKPLE 5 

Substitution of Glycine at Position 169 
The substitution of Glyl69 in £. amvlbliauefdciens 
subtilisin with Ala and Ser is described in EPO 
Publication No* 013 0756. The same method was used to 
make the remaining 17 mutants containing all other 
substituent amino acids for position 169. 

The construction protocol is summari2ed in Figure 18 • 
The overscored and underscored double stranded DNA 
cassettes used contained the following triplet 
encoding the substitution of the indicated amino acid 
at residue 169. 



GCT 


A 


ATG 


M 


TGT 


C 


AAC 


N 


GAT 


D 


CCT 


P 


GAA 


E 


CAA 


Q 


TTC 


F 


AGA 


R 


GGC 


G 


AGC 


S 


CAC 


H 


ACA 


T 


ATC 


I 


GTT 


V 


AAA 


' K 


TGG 


W 


CTT 


L 


TAC 


Y 



Each of the plasmids containing a substituted Glyl69 
was designated pX169, where X represents the 
substituent amino acid. The mutant subtilisins were 
simialrly designated. 

Two of the above mutant subtilisins, A169 and S169, 
were analyzed for substrate specificity against 
synthetic substrates containing Fhe, Leu, Ala and Arg 
in the P-l position* The following results are shown 
in Table X. 
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TABLE X 

Effect of Serine and Alanine Mutations 
at Position 169 on P-l Substrate Specificity 



P-l Substrate fkcat/Km x 10 ) 



5 



Position 169 


Phe- 


Leu 


Ala 


Arg 


Gly (wild type) 


40 


10 


l 


0.4 


A169 


120 


20 


i 


0.9 


S169 


50 


10 


l 


0.6 



10 



These results indicate that substitutions of Ala and 
Ser at Glyl69 have remarkably similar catalytic 
efficiencies against a range of P-l substrates 
15 compared to their position 166 counterparts. This is 
probably because position 169 is at the bottom of the 
P-l specificity subsite. 

EXAMPLE 6 

20 

Substitution at Position 104 

Tyrl04 has been substituted with Ala, His, Leu, Met 
and Ser, The method used was a modification of the 

25 site directed mutagenesis method. According to the 
protocol of Figure 19, a primer (shaded in line 4} 
introduced a unique Hindu I site and a frame shift 
mutation at codon 104 . Restriction-purification for 
the unique Hindlll site facilitated the isolation of 

30 the mutant sequence (line 4). Restriction-selection 
, against this Hind lll site using pimers in line 5 was 
used to obtain position 104 mutants* 
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The following triplets were used in the primers of 

Fiaure 19, line 5 for the 104 codon which substituted 
the following amino acids. 



5 



10 



GCT 


A 


TTC 


F 


ATG 


M 


CCT 


P 


CTT 


L 


ACA 


T 


AGC 


S 


TGG 


W 


CAC 


H 


TAC 


y 


CAA 


Q 


GTT 


V 


GAA 


E 


AGA 


R 


GGC 


G 


AAC 


N 


ATC 


I 


GAT 


D 


AAA 


K 


TGT 


C 



The substrates in Table XI were used to analyze the 
substrate specificity of these mutants. The results 
obtained fo Hi 04 subtilisin are shown in Table XI, 
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TABLE XI 
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Substrate 

sAAFFpNA 
sAAPApHA 
SFAPFpNA 
sFAEApNA 



kcat 



Km 



wr 


H104 


wr 


H104 


50.0 


22.0 


1.4xl0~ 4 


7.1xl0~ 4 


3.2 


2.0 


2.3xl0~ 4 


1.9xl0~ 3 


26.0 


38.0 


1.8xl0~ 4 


4.1xl0"" 4 


0.32 


2.4 


7.3xl0" 5 


1.5xl0"" 4 



Kcat/fin 



1.4x10 



H104 



3.6x10 s 3.1x10 



1x10" 



1.5X10 3 9.1x10 
4.4xl0 3 1.6x10- 



30 



35 



From these data it is clear that the substitution of 
His for Tyr at position 104 produces an enzyme which 
is more efficient {higher kcat/Km) when Phe is at the 
p-4 substrate position than when Ala is at the P-4 
substrate position. 
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Substitution of Alal52 

Alal52 has been substituted by Gly and Ser to 
determine the effect of such substitutions on 
substrate specificity. 



10 



15 



The wild type DNA sequence was mutated by the 
V152/F153 primer (Figure 20, line 4) using the above 
restriction-purification approach for the new Kon l 
site. Other mutant primers (shaded sequences Figure 
20; S152, line 5 and G152, line 6) mutated the new 
Kpnl site away and such mutants were isolated using 
the restriction-selection procedure as described above 
for loss of the Kpnl site. 



20 



The results of these substitutions for the above 
synthetic substrates containing the F-l amino acids 
Phe, Leu and Ala are shown in Table XII. 



TABLE XII 



25 



Position 15 2 



P-l Substrate 



(kcat/KmxlO -4 ) 



-Ebj 



Leu 



Ala 



30 



Gly (G) 

Ala (wild type) 
Ser (S) 



0.2 
40.0 
1.0 



0.4 
10.0 
0.5 



<0.04 
1.0 
0.2 



35 



These results indicate that, in contrast to positions 
166 and 169, replacement of Alal52 with Ser or Gly 
causes a dramatic reduction in catalytic efficiencies 
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across all substrates tested* This suggests Alal52, 
at the top of the S-l eubsite, may be the optimal 
amino acid because Ser and Gly are homologous Ala 
substitutes . 

5 

EXAMPLE 8 
Substitution at Position 156 

Mutants containing the substitution of Ser and Gin for 
10 Glul55 have been constructed according to the overall 
method depicted in Figure 21. This method was 
designed to facilitate the construciton of multiple 
mutants at position 156 and 166 as will be described 
hereinafter. However, by regenerating the wild type 
15 Glyl66, single mutations at Glul56 were obtained. 

The plasmid pM66 is already depicted in line 2 of 
Figure 13. The synthetic oligonucleotides at the top 
right of Figure 21 represent the same DNA cassettes 
20 depicted in line 4 of Figure 13. The plasmid pl66 in 
Figure 21 thus represents the mutant plasmids of 
Examples 3 and 4. In this particular example, pl66 
contains the wild type Glyl66. 

2 5 construction of position 156 single mutants were 
prepared by ligation of the three fragments (1*3) 
indicated at the bottom of Figure 21. Fragment 3, 
containing the carboxy-terminal portion of the 
subtilisin gene including the wild type position 166 

30 codon, was isolated as a 610 bp Sacl^BamHI . fragment * 
Fragment 1 contained the vector sequences, as well as 
the amino-terminal sequences of the subtilisin gene 
through codon 151. To produce fragment 1, a unique 
Kpn l site at codon 152 was introduced into the wild 

35 type subtilisin sequence from pS4,5. Site-directed 
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mutagenesis in M13 employed a primer having the 
sequence 5 * -TA-GTC-GTT— GCG-GTA-CCC-GGT-AAC-GAA-3 ' to 
produce the mutation. Enrichment for the mutant 
sequence was accomplished by restriction with KpnI. 
purification and self ligation. The mutant sequence 
containing the Rcni site was confirmed by direct 
plasmid sequencing to give pVi52. pV152 (-1 P g) was 
digested with jCpnl and treated with 2 units of DNA 
polymerase I large fragment {Klenow fragment from 
Boeringer-Mannheim) plus 50 deoxynucleotide 
triphosphates at 37 *C for 30 min. This created a 
blunt end that terminated with codon 151. The DNA was 
extracted with 1:1 volumes phenol and CHClg and DNA in 
the aqueous phase was precipitated by addition of 0.1 
volumes 5M ammonium acetate and two volumes ethanol. 
After centrifugation and washing the DNA pellet with 
15 70% ethanol, the DNA was lyophilized. DNA was 
digested with BamHl and the 4.6Jcb piece (fragment 1) 
was purified by acrylamide gel electrophoresis 
followed by electroelution. Fragment 2 was a duplex 
synthetic DNA cassette which when ligated with 
fragments 1 and 3 properly restored the coding 
sequence except at codon 156. The top strand was 
synthesized to contain a glutamine codon, and the 
complementary bottom strand coded for serine at 156. 
Ligation of heterophosphorylated cassettes leads to a 
large and favorable bias for the phosphorylated over 
the non-phosphorylated oligonucleotide sequence in the 
final segrated plasmid product. Therefore , to obtain 
Q156 the top strand was phosphorylated, and annealed 
to the non-phosphorylated bottom strand prior to 
30 ligation. Similarly, to obtain S156 the bottom strand 
was phosphorylated and annealed to the 
non-phosphorylated top strand. Mutant sequences were 
isolated after ligation and transformation,, and were 
confirmed by restriction analysis and DNA sequencing 

35 
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as before. To express variant subtilisins , plasmids 
were transformed into a subtilisin-neutral protease 
deletion mutant of g. subtilis . BG2 036, as previously 
described. Cultures vere fermented in shake flasks 
for 24 h at 37 *C in LB media containing 12.5 mg/mL 
chloramphenicol and subtil is in was purified from 
culture supernatants as described. Purity of 
subtilisin was greater than 95% as judged by SDS PAGE. 

These mutant plasmids designated pS156 and pQ156 and 
mutant subtilisins designated S156 and Q156 were 
analyzed with the above synthetic substrates where P-l 
comprised the amino acids Glu, Gin, Met and Lys. The. 
results of this analyses are presented in Example 9. 



15 

EXAMPLE 9 



Multiple Mutants With Altered 
Substrate Specificity - Substitution 
at Positi ons 7 5fi and 166 

Single substitutions of position 166 are described in 
Examples 3 and 4. Example 8 describes single 
substitutions at position 156 as well as the protocol 
of Figure 21 whereby various double mutants comprising 
the substitution of various amino acids at positions 
156 and 166 can be made. This example describes the 
construction and substrate specificity of subtilisin 
containing substitutions at position 156 and 166 and 
summarizes some of the data for single and double 
mutants at positions 156 and 166 with various 
substrates . 

K166 is a common replacement amino acid in the 156/166 
mutants described herein. The replacement of Lys for 



! 
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Glyl66 was achieved by using the synthetic DNA 
cassette at the top right of Figure 21 which contained 
the triplet AAA for NNN. This produced fragment 2 
with lys substituting for Glyl66. 

The 156 substituents were Gin and Ser* The Gin and 
Ser substitutions at Glyl56 are contained within 
fragment 3 (bottom right Figure 21} . 

The multiple mutants were produced by combining 
fragments 1, 2 and 3 as described in Example 8. The 
mutants Q156/K166 and S156/K166 were selectively 
generated by differential phosphorylation as 
described. Alternatively, the double 156/166 mutants, 
c.f. Q156/K166 and S156/K166, were prepared by 
ligation of the 4-6kb Sac I- BamH I fragment from the 
relevant pl56 plasmid containing the 0.6kb SacI-BamHI 
fragment from the relevant pi 6 6 plasmid. 

These mutants, the single mutant K166, and the SI 5 6 
and Q156 mutants of Example 8 were analyzed for 
substitute specificity against synthetic polypeptides 
containing Phe or Glu as the P-l substrate residue. 
The results are" presented in Table XIII, 
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As can be seen in Table XIV, either of these single 
mutations improve enzyme performance upon substrates 
with glutamate at the P-l enzyme binding site. When 
these single mutations were combined, the resulting 
multiple enzyme mutants are better than either parent* 
These single or multiple mutations also alter the 
5 relative pH activity profiles of the enzymes as shown 
in Figure 23. 

To isolate the contribution of electrostatics to 
substrate specificity from other chemical binding 

10 forces, these various single and double mutants were 
analyzed for their ability to bind and cleave 
synthetic substrates containing Glu, Gin, Ket and Lys 
as the p-l substrate amino acid. This permitted 
comparisons between side-chains that were more 

15 sterically similar but differed in charge (e.g., Glu 
versus Gin, Lys versus Met) . Similarly, mutant 
enzymes were assayed against homologous P-l substrates 
that were most sterically similar but differed in 
charge (Table XIV) . 
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Footnotes to Table XIV : 0251446 

9 B. subtilis, BG 2036, expressing indicated 
variant subtilisin were fermented and enzymes purified 
as previously described (Estell, gt al* (1985) 
Biol. Chem. 260 , 6518*6521) . Wild type subtilisin is 
indicated (wt) containing Glul56 and Glyl66, 

^ Net charge in the P-l binding site is defined as 
5 the suin of charges from positions 156 and 166 at pH 
8.6. 

v s Values for kcat(s *) and Km(M) were measured in 
0.1M Tris pH 8,6 at 25 6 C as previously described 
against ■ P-l substrates having the form 
succinyl-L-AlaL-AlaL-ProL- [X] -p-nitroanilide , where X 
is the indicated P^l amino acid* Values for log 1/Rm 
10 are shown inside parentheses. All errors in 
determination of kcat/Km and 1/Km are below 5%. 

^ Because values for Glul56/Aspl66 (D166) are too 
small to determine accurately, the maximum difference 
taken for GluF-1 substrate is limited to a charge 
range of +1 to -1 charge change. 



15 



20 



25 



30 



n.d. *= not determined 

The kcat/Km ratios shown are the second order rate 
constants for the conversion of substrate to product, 
and represent the catalytic efficiency of the enzyme. 
These ratios are presented in logarithmic form to 
scale the data, and because log kcat/Km is 
proportional to' the lowering of transition-state 
activation energy (AG T ) . Mutations at position 156 
and 16 6 produce changes in catalytic efficiency toward 
Glu, Gin, Met and Lys P*l substrates of 3100, 60, 2 00 
and 20 fold, respectively. Making the P-l 
binding-site more positively charged [e.g., compare 
Glnl56/Lysl66 (Q156/K166) versus Glul56/Metl66 
(GlulS6/Ml66) ] dramatically increased kcat/Km toward 
the Glu P-l substrate (up to 3100 fold) , and decreased 
the catalytic efficiency toward the Lys P-l substrate 
(up to 10 fold) * In addition, the results show that 
the catalytic efficiency of wild type enzyme can be 
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greatly improved toward any of the four P-l substrates 
by mutagenesis of the F-l binding site. 



The changes in kcat/Km are caused predominantly by 
changes in l/Km. Because l/Km is approximately equal 
to 1/Ks, the enzyme-substrate association constant, 

5 the mutations primarily cause a change in substrate 
binding. These mutations produce smaller effects on 
kcat that run parallel to the effects on 1/Knw The 
changes in kcat suggest either an alteration in 
binding in the P-l binding site in going from the 

10 Michaelis-complex E*S) to the transition-state complex 
(E-S/) as previously proposed (Robertus, J.D. , et al . 
(1972) Biochemistry , 11 , 2439-2449? Robertus, J.D., et 
al. (1972) Biochemistry 11 , 4293-4303) , or change in 
the position of the scissile peptide bond over the 

15 catalytic serine in the E*S complex. 

Changes in substrate preference that arise from 
changes in the net charge in the P-l binding site show 
trends that are best accounted for by electrostatic 

20 effects (Figure 2B) . As the P-l binding cleft becomes 
more positively charged, the average catalytic 
efficiency increases much more for the Glu P-l 
substrate than for its neutral and isosteric P-l 
homolog, Gin (Figure 2 8 A) . Furthermore, at the 

25 positive extreme both substrates have nearly identical 
catalytic efficiencies. 

In contrast , as the P-l site becomes more positively 
charged the catalytic efficiency toward the Lys P-l 
J0 substrate decreases, and diverges sharply from its 
neutral and isosteric homolog, Met (Figure 28B) . The 
similar and parallel upward trend seen with increasing 
positive charge for the Met and Glu P-l substrates 
probably results from the fact that all the substrates 
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are succinylated on their amino-terminal end, and thus 
carry a formal negative charge. 

The trends observed in log kcat/Km are dominated by 
changes in the Km term (Figures 28C and 28D) . As the 
pocket becomes more positively charged, the log 1/Km 

5 values converge for Glu and Gin P-l substrates (Figure 
2 8C), and diverge for Lys and Het P-l substrates 
(Figure 28D) . Although less pronounced effects are 
seen in log kcat, the effects of p-l charge on log 
kcat parallel those seen in log 1/Ka and become larger 

10 as the P-l pocket becomes more positively charged. 
This may result from the fact that the transition- 
state is a tetrahedral anion, and a net positive 
charge in the enzyme may serve to provide some added 
stabilization to the transition-state. 

15 

The effect of the change in P-l binding-site charge on 
substrate preference can be estimated from the 
differences in slopes between the charged and neutral 
isosteric P-l substrates (Figure 28B) . The average 

2Q change in substrate preference (a log kcat/Km) between 
charged and neutral isosteric substrates increases 
roughly lo-fold as the complementary charge or the 
enzyme increases (Table XV) . When comparing Glu 
versus Lys, this difference is 100-fold and the change 

25 in substrate preference appears predominantly in the 
Km term. 
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TABLE xv 0251446 

Differential Effect on Binding Site 
Charge on log kcat/Km or (log 1/Km) . , 
for P-l Substrates that Differ in Charge* 



Change in P-l Binding A * log kcat/Km Ulog i/ K m) 
Site Charge _ GluGln MetLys GluLys 

-2 to -1 n.d. 1.2 (1-2) n.d. 

•1 to 0 0.7 (0.6) 1.3 (0.8) 2.1 (1.4) 

0 to +1 1.5 (1.3) 0.5 (0.3) 2.0 (1.5) 



Avg. change in 
log kcat/K or 
(log l/KmJ^per 

unit charge change 1.1 (1.0) 1.0 (0,8). 2.1 (1,5) 



15 laj The aiff erence i n the slopes of curves were taken 
between the P-l substrates over the charge interval 
given for log (kcat/Km) (Figure 28A, B) and (log 1/Km) 
(Figure 28C, p) , Values represent the differential 
effect a charge change has in distinguishing the 
substrates that are compared. 

20 tb) Charge in P-l binding site is defined as the sum 
of charges from positions 156 and 166. 
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The free energy of electrostatic interactions in the 
structure and energetics of salt-bridge formation 
depends on the distance between the charges and the 
microscopic dielectric of the media. To dissect these 
structural and microenvironmental effects, the 

5 energies involved in specific salt-bridges were 
evaluated. In addition to the possible salt-bridges 
shown (Figures 29A and 29B) , reasonable salt-bridges 
can be built between a Lys P-l substrate and Asp at 
position 166, and between a Glu P-l substrate and a 

10 Lys at position 166 (not shown) • Although only one of 
these structures is confirmed by X-ray crystalography 

(Poulos, T.L., et al. (1976) J. Mol , Biol. 257 

1097-1103), all models have favorable torsion angles 
(Sielecki, A.R. ^ et al. (1979) J. Mol. Biol. 134 . 
15 781-804), and do not introduce unfavorable van der 
Waals contacts. 

The change in charged P-l substrate preference brought 
about by formation of the model salt-bridges above are 
20 shown in Table XVI, 
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Footnotes -bo Table XVI : 

Molecular modeling shows it is possible to form a 
salt bridge between the indicated charged P-l 
substrate 'and a complementary charge in the P-l 
binding site of the enzyme at the indicated position 
changed. 

(h) 

Enzymes compared have sterically similar amino 
acid substitutions that differ in charge at the 
indicated position. 

The P-l substrates compared are structurally 
similar but differ in charge. The charged p-l 
substrate is complementary to the charge change at the 
position indicated between enzymes 1 and 2. 

Date from Table XIV was used to compute the 
difference in log (kcat/Km) between the charged and 
the non-charged P-l substrate (i.e., the substrate 
preference) . The substrate preference is shown 
separately for enzyme 1 and 2. 

f e) 

The difference in substrate preference between 
enzyme 1 (more highly charged) and enzyme 2 (more 
neutral) represents the rate change accompanying the 
electrostatic interaction. 



The difference between catalytic efficiencies (i.e., 
A log kcat/Km) for the charged and neutral P-l 
substrates (e.g., Lys minus Met or Glu minus Gin) give 
the substrate preference for each enzyme. The change 
in substrate preference' (AAlog kcat/Km) between the 
charged and more neutral enzyme homologs (e.g., 
Glul56/Glyl66 minus Glnl56 (Q156)/Glyl66) reflects the 
change in catalytic efficiency that may be attributed 
solely to electrostatic effects. 



These results show that the average change in 
substrate preference is considerably greater when 
electrostatic substitutions are produced at position 
166 (50-fold in kcat/Km) versus position 156 (12-fold 
in kcat/Km). From these log kcat/Km values, an 
average change in transition-state stabilization 
energy can be calculated of -1.5 and -2,4 kcal/mol for 
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substitutions at positions 156 and 166, respectively. 
This should represent the stabilization energy 
contributed from a favorable electrostatic interaction 
for the binding of free enzyme and substrate to form 
the transition-state complex. 

5 EXAMPLE 10 

Substitutions at Position 217 

Tyr217 has been substituted by all other 19 amino 
10 acids- Cassette mutagenesis as described in EPO 
publication No- 0130756 w.as used according to the 
protocol of Figure 22. The EcoRV restriction site was 
used for restriction-purification of pA217. 

15 Since this position is involved in substrate binding, 
mutations here effect kinetic parameters of the 
enzyme. An example is the substitution of Leu for Tyr 
at position 217. For the substrate sAAPFpNa , this 
mutant has a kcat of 277 5' and a Km of 4,7x10* with 

20 a kcat/Km ratio of 6xl0 5 . This represents a 5.5-fold 
increase in kcat with a 3-fold increase in Km over the 
wild type enzyme. 

In addition, replacement of Tyr 217 by Lys, Arg, Phe or 
25 Leu results in mutant enzymes which are more stable at 
pHs of about 9-11 than the WT enzyme. Conversely, 
replacement of Tyr217 by Asp, Glu, Gly or Pro results 
in enzymes which are less stable at pHs of about 9-11 
than the WT enzyme. * 
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Multiple Mutants Having 
Altered Thc^n stabjU^y 



^olig^clen subtilisin does not contain any 
5 cysteine residues. Thus, any at J 

™ ut' bil ; ty ^ r^uiL the 

substitution of m0 re than one amino acid in subtilisin 
with Cys. The folloving subtil .. sin rBsian&8 ^ 
multiply substituted with cysteine: 
10 Thr22/Ser87 

Ser24/ser87 

Mut. ge nes is of Ser 2 4 to Cys was carried out vith a s ■ 
phosphorylated - oligonucleotide primer havi ^ 
15 sequence y we 



5 » -pC-TAC-ACT-GGA^TGC-AAT-GTT-AAA-G-3 



uLerlL , " l0Catl ° n ° f ^^ches and the 

20 « * BegUenCe shows the Position of the altered 

^3A site., The B. MMte^ subtilise gene 

into „ 13 „ pll and single stranded ^ isoiat -d 

This template (Ml3mpli SU BT, was double primed with the 
25 5 phosphorylated M13 universal sequencing primer and 
the mutagenesis primer. Adelman, * al . (1983) m ^ 
183-193. The heteroduplex was transfected into 
competent MIDI cells and plaguss were probed for the 

nr\ SegUenCe (ZOller ' M * J " * ^ < 1982 > nuclei. 
30 &cid_Res i io, 6487-6500; Wallace, et al. fiosil 

Has leic Acig jtes, . o 3647-36 , } 

777 e-aj- 3647-3656) using a 

tetramethylammonium chloride hybridization protocol 
(Wood, et al. (1985) Proc. wti. z ead , gci, ne» ^ 
1585-1588) . The Ser87 to Cys mutation was prepared in 
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a similar fashion using a 5' phosphorylated primer 
having the sequence 

5 ■ -PGGC-GTT-GCG-CCA-IGC-0C&-TCA-CT-3 • . 

(The uouk indict «- p- ition ot tt « ; iB - a !* 

La the und«rU*a sequence shows the position of. 
new site.) The C24 and C87 stations were 

obtaS at a frequency of one and 
respectively. Kutant sequences were confirmee by 
dideoxy sequencing in M13 • 

Mut.genesis o£ Tyr21/Thr22 to A21/C22 was carried cut 
with a 5- phosphorylated oligonucleotide primer having 
the sequence 

5 • -pAC-TCT-CAA-GGC-GM'-$5T-GGC^TCA-AAT-GTT-2 > . 

(The asterisk show mismatches to the wild type 
sequence and the underlined sequence 
position of an altered *»» site.) Manipulations for 
heteroduplex synthesis were identical to those 
described for C24. Because direct cloning of the 
heteroduplex DNA fregment can yield increased 
frequencies of mutagenesis, the E^I-BamH! subtilise 
fragment was purified and ligated into pBS42. E- flail 
W ^34 cells were transformed with the ligation 
mixture and plasmid DHA was purified from isolated 
Lansformants. Plasmid DHA was screaned for the loss 
of the Sau3A site at =od=n 22 that was eliminated by 

M . two out of 16 plasmid 

the mutagenesis primsr. rwo _ 

prsparations had lost the wild type sau3A site. _ The 
mutant sequence was confirmed by did.oxy sequencing in 

M13. 
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Double mutants, C22/C67 and C24/C87, were constructed 
by ligating fragments sharing a common Clal site that 
separated the single parent cystine codons. 
Specifically, the 500 bp Eco RI- Cla l fragment 
containing the 5 1 portion of the subtilisin gene 
(including codons 22 and 24) was ligated with the 4.7 
kb Clal-EcoRI fragment that contained the 3 1 portion 
of the subtilisin gene (including codon 87) plus pBS42 
vector sequence. E. coli Mtt 294 was transformed with 
ligation mixtures and plasmid DNA was purified from 
individual transf ormants . Double-cysteine plasmid 
constructions were identified by restriction site 
markers originating from the parent cysteine mutants 
(i.e., C22 and C24, Sau3A minus; Cys87, Mst I plus). 
Plasmids from E . cpli were transformed into £. 
subtilis BG2 03 6. The thermal stability of these 
mutants as compared to wild type subtilisin are 
presented in Figure 3 0 and Tables XVII and XVIII. 
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10 



Effect of DTT on the Half-Time of 
Autolytic inactivation of Wild-Type 
and Disulfide Mutants of Subtilisin* 



Enzyme -DDT +DTT -DTT/+DTT 

""' min ' 

Wild-type 95 85 1.1 

C22/C87 44 25 1.8 

C24/C87 92 62 1.5 



1 Purified enzymes were either treated or not 
treated with 25mM DTT and dialyzed with or without 
lOmM DTT in 2mM CaCl 2 , 50mM Tris (pH 7.5) for 14 hr. 
at 4 C C. Enzyme concentrations were adjusted to 80ul 
aliquots were quenched on ice and assayed for residual 
activity. Half-times for autolytic inactivation were 
determined from semi-log plots of lo ?io (residual 
activity) versus time. These plots were linear for 
over 90% of the inactivation. 
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TABLE XVIII 
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Effect of Mutations in Subtilisin 
. on the Half -Time of Autolytic 
Inactivation at 58 * C* 



Enzyme N 

5 **in 

V?ild-type 120 
C22 22 

C24 120 

C87 104 
10 C22/C87 43 

C24/C87 115 



f *1 

Half-times for autolytic inactivation were 
15 determined for " wild-type and mutant subtilisins as 
described in the legend to Table XXI, Unpurified and 
non-reduced enzymes were used directly from |J P . 
subtilis culture supernatants. 
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The disulfides introduced into subtilisin did not 
improve the autolytic stability of the mutant enzymes 
when compared to the wild-type enzyme. However, the 
disulfide bonds did provide a margin of autolytic 
stability when compared to their corresponding reduced 
double-cysteine enzyme. Inspection of a highly 
refined x-ray structure of wild-type B. amvloliaue - 
faciens subtilisin reveals a hydrogen bond between 
Thr22 and Ser87. Because cysteine is a poor hydrogen 
donor or acceptor (Paul, I.e. (1974) in Chemistry of 
the -SH Group (Patai, S., ed.) pp. 111-149, Wiley 
Interscience, New York) weakening of 22/87 hydrogen 
bond may explain why the C22 and C87 single-cysteine 
mutant proteins are less autolytically stable than 
either C24 or wild-type (Table XVIII). The fact that 
C22 is less autolytically stable than C87 may be the 
result of the Tyr21A mutation (Table XVIII). Indeed, 
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construction and analysis of Tyr21/C2 2 shows the 
mutant protein has an autolytic stability closer to 
that of C87. In summary, the C22 and C87 of 
single-cysteine mutations destabilize the protein 
toward autolysis, and disulfide bond formation 
increases the stability to a level less than or equal 
to that of wild-type enzyme. 
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EXAMPLE 12 

Multiple Mutants Containing Substitutions 
at Position 222 an d Position 166 or 169 

Double mutants 166/222 and 169/222 were prepared by 
ligating together (1) the 2.3kb fecall fragment from 
pS4.5 which contains the 5 1 portion of the subtilisin 
gene and vector sequences, (2) the 200bp Avail 
fragment which contains the relevant 166 or 169 
mutations from the respective 166 or 169 plasmids, and 
(3) the 2.2kb Avail fragment which contains -the 
relevant 222 mutation 3 1 and of the subtilisin genes 
and vector sequence from the respective p222 plasmid. 

Although mutations at position 222 improve oxidation 
stability they also tend to increase the Km* An 
example is shown in Table XIX. In this case the A222 
mutation was combined with the K166 mutation to give 
an enzyme with kcat and Km intermediate between the 
two parent enzymes. 
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TABLE XIX 
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kcat 

WT 50 
A222 42 
5 K166 21 

K166/A222 29 

s libs tr ate sAAPFpNa 

10 

EXAMPLE 13 

Multiple Mutants Containing 
15 Substitutions at Positions 50, 156, 
166, 217 and Combinations Thereof 

The double mutant S15 6/A169 was prepared by ligation 
of two fragments, each containing one of the relevant 
mutations. The plasmid pS156 was cut with 2£mal and 

20 treated with SI nuclease to create a blunt end at 
codon 167* After removal of the nuclease by 
phenol/chloroform extraction and ethanol precipita- 
tion, the DNA was digested with BamH I and the 
approximately 4kb fragment containing the vector plus 

25 the 5 1 portion of the subtilisin gene through codon 
167 was purified. 

The pA169 plasmid was digested with Kpnl and treated 
with DNA polymerase Klenow fragment plus 50 /iM dNTPs 
30 to create a blunt end codon "at codon 168, The Klenow 
was removed by phenol/chloroform extraction and 
ethanol precipitation. The DNA was digested with 
BamHI and the 590bp fragment including codon 168 
through the carboxy terminus of the subtilisin gene 

35 



Km 

1.4X10 
9.9X10 
3.7X10 
2.0x10 
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was isolated. The two fragments were then ligated to 
give S156/A169. 



Triple and quadruple mutants were prepared by ligating 
together (1) the 220bp Pvu II/ Hae ll fragment containing 
the relevant 156, 166 and/or 169 mutations from the 
5 respective p!56, p!66 and/or pl69 double of single 
mutant plasmid, (2) the 550bp Haell/BamHI fragment 
containing the relevant 217 mutant from the respective 
p217 plasmid, and (3) the 3.9kb Pvu Il/ Bam HI fragment 
containing the F50 mutation and vector sequences. 

10 

The multiple mutant F50/S156/A169/L217, as well as B. 
amylolicruef aciens subtilisin, B. lichen formis 
subtilisin and the single mutant L217 were analyzed 
with the above synthetic polypeptides where the P~l 
15 amino acid in the substrate was Lys, His, Ala, Gin, 
Tyr, Phe, Met and Leu. These results are shown in 
Figures 26 and 27. 

These results show that the F50/S156/A169/L217 mutant 
20 has substrate specificity similar to that of the B. 
licheniformis enzyme and differs dramatically from the 
wild type enzyme. . 'Although only data for the L217 
mutant are shown, none of the single mutants (e.g., 
F50, S156 or A169) showed this effect, ■ Although B. 
25 licheniformis differs in 88 residue positions from B. 
amylolicruef aciens , the combination of only these four 
mutations accounts for most of the differences in 
substrate specificity between the two enzymes. 

30 EXAMPLE 14 

Subtilisin Mutants Having 
Altered Alkaline Stability 

A random mutagenesis technique was used to generate 
single and multiple mutations within the B. 

35 
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amvloliauef aciens subtilisin gene* Such mutants were 
screened for altered alkaline stability , Clones 
having increased (positive) alkaline stability and 
decreased (negative) alkaline stability were isolated 
and sequenced to identify the mutations within the 
subtilisin gene. Among the positive clones, the 
5 mutants V107 and R213 were identified ■ These single 
mutants were subsequently combined to produce the 
mutant V107/R213. 

One of the negative clones (V50) from the random 
1q mutagenesis experiments resulted in a marked decrease 
in alkaline stability. Another mutant (P50) was 
analyzed for alkaline stability to determine the 
effect of a different substitution at position 50, 
The F50 mutant was found to have a greater alkaline 
15 stability than wild type subtilisin and when combined 
with the double mutant V107/R213 resulted' in a mutant 
having an alkaline stability which reflected the 
aggregate of the alkaline stabilities for each of the 
individual mutants, 

20 

The single mutant R204 and double mutant C2 04/R213 
were identified by alkaline screening after random 
cassette mutagenesis over the region from position 197 
to 228. The C204/R213 mutant was thereafter modified 

25 to produce mutants containing the individual mutations 
C204 and R213 to determine the contribution of each of 
the individual mutations. Cassette mutagenesis using 
pooled oligonucleotides to substitute all amino acids 
at position 204, was utilized to determine which 

jq substitution at position 204 would maximize the 
increase in alkaline stability. The mutation from 
Lys213 to Arg was maintained constant for each of 
these substitutions at position 2 04. 
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A. Construction of pB0180, an 

E. coli-B. subtilis Shuttle Plasmid 

The 2.9 kb EcoRI-BamHI fragment from pBR3 27 
(Covarrubias , L* , et al. (1981) Gene 13, 25-35) was 
ligated to the 3.7kb Eco RI- Bam HI fragment of pBD6 4 
(Gryczan, T., et al, (1960) J. Bacterid. , 141, 
5 246-253) to give the recombinant plasmid pB0153. The 
unique EcoRI recognition sequence in pBD6 4 was 
eliminated by digestion with Eco RI followed by 
treatment with Klenow and deoxynucleotide 
triphosphates (Maniatis, T. , et al. (eds.) (1982) in 
10 Molecular Cloning/ A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y.). Blunt 
end ligation afrd transformation yielded pB0154, The 
unique ' Aval recognition sequence in pB0154 was 
eliminated in a similar manner to yield pB0I7a, 
15 pB0171 was digested with BamHI and PvuII and treated 
with Klenow and deoxynucleotide triphosphates to 
create blunt ends. The 6.4 kb fragment was purified, 
ligated and transformed into LE392 cells (Enquest, 
L.W., et al. (1977) J • Mol. Biol. Ill , 97-120), to 
20 yield pB0172 which retains the unique BamHI site* To 
facilitate subcloning of subtilisin mutants, a unique 
and silent Kpn l site starting at codon 166 was 
introduced into the subtilisin gene from pS4.5 (Wells, 
J. A., et al* (1983) Nucleic Acids Res. , 11 , 7911-7925) 
25 by site-directed mutagenesis. The Kpn l+ plasmid was 
digested with Eco RI and treated with Klenow and 
deoxynucleotide triphosphates to create a blunt end. 
The Klenow was inactivated by heating for 20 min at 
68 °C, and the DNA was digested with Bajn HI . The 1.5 kb 
30 blunt Eco RI "- BamH I fragment containing the entire 
subtilisin was ligated with the 5,8 kb Nrul -BamHI from 
pB0172 to yield pB0180. The ligation of the blunt 
Nrul end to the blunt Eco RI end recreated an EcoRI 

35 
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site. Proceeding clockwise around pB0180 from the 
EcoRI site at the 5' end of the subtilisin gene is the 
unique Bam HI site at the 3' end of the subtilisin 
gene, the chloramphenicol and neomycin resistance 
genes and UB110 gram positive replication origin 
derived from pBD64, the ampicillin resistance gene and 
gram negative replication origin derived from pBR327. 



B. Construction of Random Mutagenesis Library 

The 1.5 kb Eco RI - Bam HI fragment containing the B. 
amyloliguefaciens subtilisin gene (Wells et al., 1983) 
from pB0180 was cloned into Ml3mpll to give M13mpll 
SUBT essentially as previously described (Wells , J. A., 
et al. (1986) J, Biol. Chenu , 261 , 6564-6570) . 
Deoxyuridine containing template DNA was prepared 
according to Kunkel {Kunkel, T. A. (1985) Froc. Natl, 
Acad. Sci. USA , 82 488-492). Uridine containing 
template DNA (Kunkel, 1985) was purified by CsCl 
density gradients (Maniatis, T. et al. (eds.) (1982) 
in Molecular Cloning, A Laboratory Manual # Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N, Y, ) . A 
primer (Aval") having the sequence 

5 ' GAAAAAAG ACCCTAG CGTCGCTTA 

ending at codon -11, was used to alter the unique Ava l 
recognition sequence within the subtilisin gene, (The 
asterisk denotes the mismatches from the wild-type 
sequence and underlined is the altered Ava l site*) 

The 5 1 phosphorylated Aval primer (-320 pmol) and -40 
pmol (~120yg) of uridine containing M13mpll SUBT 
template in 1.8 8 ml of 53 mM NaCl, 7.4 mM MgC12 and 
7,4 mM Tris.HCl (p'H 7.5) were annealed by heating to 
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90°C fox 2 min. and cooling 15 min at 24°C (Fig. 31). 
Primer extension at 24 °C was initiated by addition of 
lOOyL containing 1 mM in all four deoxynucleotide 
triphosphates, and 20yl Klenow fragment {5 units/1). 
The extension reaction was stopped every 15 seconds 
over ten min by addition of 10pl 0.25 M EDTA <pH 8) to 

5 50yl aliquots of the reaction mixture. Samples were 
pooled, phenol chlorophorm extracted and DNA was 
precipitated twice by addition of 2.5 vol 100% 
ethanol, and washed twice with 70% ethanol. The 
pellet was dried, and redissolved in 0.4 ml 1 mM EDTA, 

10 10 mM Tris (pH 8) . 

Misincorporation of a-thiodeoxynucleotides onto the 3' 
ends of the pool .of randomly terminated template was 
carried out by incubating four 0.2 ml solutions each 
X5 containing one-fourth of the randomly terminated 
template mixture (~20pg) , 0.25 mM of a given 
a-thiodeoxynucleotide triphosphate, 100 units AMV 
polymerase, 50 mM KCL, 10 mM MgClj, 0.4 mM 
dithiothreitol, and 50 mM Tris (pH 8.3) (Champoux, 
20 J.J. (1984) Genetics , 2, 454-464). After incubation 
at 37 °C for 90 minutes, misincorporation reactions 
were sealed by incubation for five minutes at 37°C 
with 50 mM all four deoxynucleotide triphosphates (pH 
8) , and 50 units AMV polymerase. Reactions were 
25 stopped by addition of 25 mM EDTA (final) , and heated 
at 6 8 °C for ten min to inactivate AMV polymerase. 
After ethanol precipitation and resuspension , 
synthesis of closed circular heteroduplexes was 
carried out for two days at 14 °C under the same 
conditions used for the timed extension reactions 
above, except the reactions also contained 1000 units 
T4 DNA ligase, 0,5 mM ATP and 1 mM B-mercaptoethanol. 
Simultaneous restriction of each heteroduplex pool 
with Kpn l f Bam HI , and Eco RI confirmed that the 



30 



35 



-111- 



0251 446 



extension reactions were nearly quantitative, 
Heteroduplex DNA in each reaction mixture was 
methylated by incubation with 80yM 

S-adenosylmethionine and 150 units dam methylase for 1 
hour at 37 °C, Methylation reactions were stopped by 
heating at 68 °C for 15 i&in. 

One-half of each of the four methylated heteroduplex 

reactions were transformed into 2,5 ml competent E . 

coli JM101 (Messing, J. (1979) Recombinant DNA Tech. 

Bull . , 2 f 43-4 8} . The number of independent 

transformants from each of the four transformations 

ranged from 0,4-2,0 x 10 5 . After growing out phage 

pools, RF DNA from each of the four transformations 

was isolated and purified by centrifugation through 

CsCl density gradients. Approximately 2pg of RF DNA 

from each of the four pools was digested with Eco RI , 

BamHI and Ava l. The 1.5 kb EcoRI -BamHI fragment 

{i.e*, Ava l resistant) was purified on low gel 

temperature agarose and ligated into the 5.5 kb 

EcoRI-BamHI vector fragment of pB0180. The total 

number of independent transformants from each 

a-thiodeoxynucleotide misincorporation plasirtid library 

4 

ranged from 1.2-2.4 x 10 . The pool of plasmids from 
each of the four transformations was grown out in 2 00 
ml LB media containing 12.5pg/ml cmp and plasmid DNA 
was purified by centrifugation through CsCl density 
gradients- 



C, Expression and Screening 

of Subtilisin Point Mutants 

Plasmid DNA from each of the four misincorporation 
pools was transformed (Anagnostopoulos , C. , et al„ 
(1967), J, Bacterid. , 81, 741-746) into BG2036. For 
each transformation, 5yg of DNA produced approximately 
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2.5 x 10 5 independent BG203 6 transf ormants , and liquid 
culture aliguots from the four libraries were stored 
in 10% glycerol at 70 'C. Thawed aliguots of frozen 
cultures were plated on LB/5/ig/ml cmp/1.6% skim milk 
plates (Wells, J. A., et al. (19S3) yuclelc Acids Res. , 
11 , 7911-7925), and fresh colonies were arrayed onto 
96-well microtiter plates containing 150 1 per well LB 
media plus 12.5^g/ml cmp. After 1 h at room 
temperature, a replica was stamped (using a matched 96 
prong stamp) onto a 132 mm BA 85 nitrocellulose filter 
(Schleicher and Scheull) which was layered on a 14 0 mm 
diameter LB/cmp/skim milk plate. Cells were grown 
about 16 h at 30 *C until halos of proteolysis were 
roughly 5-7 mm in diameter and filters were 
transferred directly to a freshly prepared agar plate 
at 37 'C containing only 1.6% skim milk and 50 mM 
sodium phosphate pH 11.5. Filters were incubated on 
plates for 3-6 h at 37 "C to produce halos of about 5 
mm for wild-type subtil is in and were discarded. The 
plates were stained for 10 min at 24 'C with Coomassie 
blue solution (0.25% Coomassie blue (R-250) 25% 
ethanol) and destained with 25% ethanol, 10% acetic 
acid for 20 min- Zones of proteolysis appeared as 
blue halos on a white background on the underside of 
the plate and were compared to the original growth 
plate that was similarly stained and destained as a 
control. Clones were considered positive that 
produced proportionately larger zones of proteolysis 
on the high pH plates relative to the original growth 
plate. Negative clones gave smaller halos under 
alkaline conditions. Positive and negative clones 
were restreaked to colony purify and screened again in 
triplicate to confirm alkaline pH results. 
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D. Identification and Analysis 0251446 
of Mutant Subtilisins 

Plasmid DNA from 5 ml overnight cultures of more 
alkaline active B.subtilis clones was prepared 
according to Birnboim and Doly (Birnboim, H.C., et al. 

(1979) Nucleic Acid Res. 7, 1513) except that 
incubation with 2 mg/ml lysozyme proceeded for 5 min 
at 37°C to ensure cell lysis and an additional 
phenol/CHCl 3 extraction was employed to remove 
contaminants. The 1*5 kb EcoRI-BamHI fragment 
containing the subtilisin gene was ligated into 
M13mpll and template DNA was prepared for DNA 
sequencing (Messing, J., et al. (1982) Gene, 19^ 
269-276). Three DNA sequencing primers ending at codon 
26, +95, and +155 were synthesized to match the 
subtilisin coding sequence. For preliminary sequence 
identification a single track of DNA sequence, 
corresponding to the dNTPaS misincorporation library . 
from which the mutant came, was applied over the 
entire mature protein coding sequence {i.e., a single 
dideoxyguanosine sequence track was applied to 

20 identify a mutant from the dGTPas library) . A 
complete four track of DNA sequence was performed 200 
bp over the site of mutagenesis to confirm and 
identify the mutant sequence (Sanger, F. , et al . , 

(1980) J. Mol, Biol. , - 143, 161-178). Confirmed 
25 positive and negative bacilli clones were cultured in 

LB media containing 12.5ug/mL cmp and purified from 
culture supernatants as previously described (Estell, 
D.A., et al. (1985) J. Biol. Chem. , 260 , 6518-6521). 
Enzymes were greater than 9 8% pure as analy2ed by 
30 SDS-polyacrylamide gel electrophoresis (Laemmli, U.K. 
(1970) , Nature, 227, 680-685) , and protein 
concentrations were calculated from the absorbance at 
280 nm, e^gj* = 1-17 (Maturbara, H. , et al. (1965), J. 
Biol. Chem , 240, 1125-1130) • 
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Enzyme activity was measured with 2 00,og/mL 
BUCcinyl-L-AlaL-AlaL-ProL-Fhep~nitroanilide (Sigma) in 
0.1M Tris pH 8.6 or 0.1 M CAPS pH 10,8 at 25'C. 
Specific activity (m moles product/min-mg) vas 
calculated from the change in absorbance at 410 nm 

5 from production of p-nitroaniline with time per mg of 
enzyme (E410 ~ 8,480 M-lcm-1; Del Mar, E.G., et al. 
(1979), Anal. Biochem. , 316-320). Alkaline 

autolytic stability studies were performed on purified 
enzymes (200pg/mL) in 0.1 M potassium phosphate (pH 

10 12.0) at 37'C. At various times aliquots were assayed 
for residual enzyme activity (Wells, J. A., et al. 
(1986) J. Biol. Chem. . 261, 6564-6570) . 

1S E. Results 

1, Optimization and analysis 
of mutagenesis frequency 

A set of primer-template molecules that were randomly 
3 ' -terminated over the subtilisin gene (Fig. 31) was 

20 produced by variable extension from a fixed 5 f -primer 
(The primer mutated a unique Aval site at codon 11 in 
the subtilisin gene) . This was achieved by stopping 
polymerase reactions with EDTA after various times of 
extension. The extent and distribution of duplex 

25 formation over the 1 kb subtilisin gene fragment was 
assessed by multiple restriction digestion (not 
shown) . For example, production of new Hinfl 
fragments identified when polymerase extension had 
proceeded past IlellO, Leu233, and Asp259 in the 
30 subtilisin gene. 

Mis incorporation of each dNTPas at randomly terminated 
3' ends by AMV reverse transcriptase (Zakour, R.A. , et 
al* (1982), Nature , 295 , 708-710; Zakour, R.A., et al. 
35 (1984), Nucleic Acids Res » . 12, 6615-6628), used 
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conditions previously described (Champoux, J.J., 
(1984), Genetics, 454-464). The efficiency of each 
misincorporation reaction was estimated to be greater 
than 8 0% by the addition of each dKTPas to the Aval 
restriction primer, and analysis by polyacrylamide gel 
5 electrophoresis, Misincorporations were sealed by 
polymerization with all four dNTP's and closed 
circular DNA was produced by reaction with DNA ligase. 

Several manipulations were employed to maximize the 
yield of the mutant sequences in the heteroduplex. 
These included the use of a deoxyuridihe containing 
template (Kunkel, T. A* (1985), Proc. Natl. Acad. Sci, 
USA, 82 4BB-492; Pukkila, P.J. et al. (1983), 
Genetics , 104, " 571-5B2) , in vitro methylation of the 
mutagenic strand (Kramer, W. et al. (1982) Nucleic 
Acids Res, r 10. 6475-6485) , and the use of Ava l 
restriction-selection against the wild-type template 
strand which contained a unique Ava l site. The 
separate contribution of each of these enrichment 
procedures to the final mutagenesis frequency was not 
determined, except that prior to Ava l restriction- 
selection roughly one-third of the segregated clones 
in each of the four pools still retained a wild-type 
Aval site within the subtilisin gene. After Ava l 
restriction-selection greater than 98% of the plasmids 
lacked the wild-type Ava l site. 

The 1.5 kb EcoRI - BamHl subtilisin gene fragment that 
was resistant to &val restriction digestion, from each 
3 0 of the four CsCl purified M13 RF pools was isolated on 
low melting agarose. The fragment was ligated in situ 
from the agarose with a similarly cut £. coli -B. 
subtil is shuttle vector, pBOlBO, and transformed 
directly into £ coli LE392. Such direct ligation and 
35 transformation of DNA isolated from agarose avoided 
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loses and allowed large numbers of recombinants to 
obtained (>100 / 000 per ug" equivalent of input M13 
pool) * 

The freguency of mutagenesis for each of the four 
dNTPas misincorporation reactions was estimated from 

5 the freguency that unigue restriction sites were 
eliminated (Table XX) . The unique restriction sites 
chosen for this analysis, Cla l, PvuII, and Kpn l, were 
distributed over the subtilisin gene starting at 
codons 35, 104, and 166 , respectively. As a control, 

10 the mutagenesis frequency was determined at the PstI 
site located in the B lactamase gene which was outside 
the window of mutagenesis. Because the absolute 
mutagenesis frequency was close to the. percentage of 
undigested plasmid DNA, two rounds of restriction- 

15 selection were necessary to reduce the background of 
surviving uncut wild-type plasmid DNA below the mutant 
plasmid (Table XX) • The background of surviving 
plasmid from wild-type DNA probably represents the sum 
total of spontaneous mutations, uncut wild-type 

20 plasmid, plus the efficiency with which linear DNA can 
transform E . coli. Subtracting the frequency for 
unmutagenized DNA -(background) from the frequency for 
mutant DNA, and normalizing for the window of 
mutagenesis sampled by a given restriction analysis 

25 (4-6 b P> provides an estimate of the mutagenesis 
efficiency over the entire coding sequence (-1000 bp) . 
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a-thiol 
dNTP 

mis incor- 
porated/^ 



Restriction 

Site 1st 
Selection round 



% resistant clones c % resistant 

clones over 



2nd 

round Total Background^ lOOQbp* 



% 

mutants 
•per 



5 


None 


PstI 


0.32 


0.7 


0.002 


0 






G 


PstI 


0.33 


1.0 


0.003 


0.001 


0.2 




T 


PstI 


0.32 


<0.5 


<0.002 


0 


0 




C 


PstI 


0.43 


3.0 


0.013 


0.011 


3 


10 


None 


Clal 


0.28 


5 


0.014 . 


. 0 






G 


Clal 


2.26 


85 


1.92 


1.91 


380 




T 


Clal 


0.48 


31 


0.15 " 


0.14 


35 




C 


Clal 


0.55 


15 


0.08 


0.066 


17 


15 


None 


PvuII - 


0.08 


29 


0.023 


0 






G 


PVUII 


0.41 


90 


0.37 ' 


0.35 


88 




T 


PvuII 


0.10 


67 


0.067 


0.044 


9 




C 


PvuII 


0.76 


53 


0.40 


0.38 


95 



20 



None 


Kpnl 


0.41 


3 


0.012 


0 




G 


Kpnl 


0.98 


35 


0.34 


0.33 


83 


T 


Kpnl 


0.36 


15 


0.054 


0.042 


8 


c . 


Kpnl 


- 1.47 


26 


0.38 


0.37 


93 
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(a) 

Mutagenesis frequency is estimated from the 
frequency for obtaining mutations that alter unique 
restriction sites within the mutagenized subtilisin 
gene (i.e., Cla l , PvuII, or Kpn l) compared to mutation 
frequencies of the Pst I site, that is outside the 
window of mutagenesis. 

<b) 

Plasmid DNA was from wild-type (none) or 
mutagenized by dNTPas misincorporation as described. 

(c) 

Percentage of resistant clones was calculated 
from the fraction of clones obtained after three fold 
or greater over-digestion of the plasmid with the 
indicated restriction enzyme compared to a 
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non-digested control . Restriction-resistant plasmid 
DNA from .the first round was subjected to a second 
round of restriction-selection. The total represents 
the product of the fractions of resistant clones 
obtained from both rounds of selection and ^ gives 
percentage of restriction-site mutant clones in the 
original starting pool. Frequencies were derived from 
counting at least 20 colonies and usually greater than 
5 100. 

^ Percent resistant clones was calculated by 
subtracting the percentage of restriction-resistant 
clones obtained for wild-type DNA (i.e., none} from 
that obtained for mutant DNA, 

xo * e ' This extrapolates from the frequency of mutation 
over each restriction site to the entire subtilisin 
gene (-1 kb) . This has been normalized to the number 
of possible bases (4-6 bp) within each restriction 
site that can be mutagenized by a given 
misincorporation event* 
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From this analysis, the average percentage of 
subtilisin genes containing mutations that result from 
dGTPas, dCTPas, or dTTPas misincorporation was 
estimated to be 90/ 70 f and 20 percent , respectively. 
These high mutagenesis frequencies were generally 
quite variable depending upon the dNTPas and 
misincorporation efficiencies at this site. 
Misincorporation efficiency has been reported to be 
both dependent on the kind of mismatch, and the 
context of primer (Champoux, J. J., (1984)? Skinner, 
J. A., et al. (1986) Nucleic Acids Res. , 14 , 
6945-6964) . Biased misincorporation efficiency of 
dGTPas and dCTPas over dTTPas has been previously 
observed (Shortle, D. f et al. (1985), Genetics ; 110 , 
539-555). Unlike the dGTPas, dCTPas, and dTTPas 
libraries the efficiency of mutagenesis for the dATPas 
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misincorporation library could not be accurately 
assessed because 90% of the restriction-resistant 
plasmids analyzed simply lacked the' subtilisin gene 
insert. This problem probably arose from 

self-ligation of the vector when the dATPas 
mutagenized subtilisin gene was subcloned from M13 
into pB0180. Correcting for the vector background, we 
estimate the mutagenesis frequency around 2 0 percent 
in the dATPas misincorporation library. In a separate 
experiment {not shown) , the mutagenesis efficiencies 
for dGTPas and dTTPas misincorporation were estimated 
to be around 50 and 30 percent, respectively, based on 
the frequency of reversion of an inactivating mutation 
at codon 169. 

The location and identity of each mutation was 
determined by a single track of DNA sequencing 
corresponding to the mis incorporated athiodeoxy- 
nucleotide over the entire gene followed by a complete 
four track of DNA sequencing focused over the site of 
mutation. Of 14 mutants identified, the distribution 
was similar to that reported by Shortle and Lin (19 85) 
except we did not observe nucleotide insertion or 
deletion mutations. The proportion of AG mutations 
was highest in the G misincorporation library, and 
some unexpected point mutations appeared in the dTTPas 
and dCTPas libraries. 



2. Screening and Identification of 

Alkaline Stability Mutants of Subtilisin 

It is possible to screen colonies producing subtilisin 
by halos of casein digestion {Wells, J, A. et al. 
(1983) Nucleic Acids Res. , 11 , 7911-7925). However, 
two problems were posed by screening colonies under 
high alkaline conditions (>pH 11). First, B. subtilis 
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will not grow at high pH # and we have been unable to 
transform an alkylophilic strain of bacillus* This 
problem was overcome by adopting a replica plating 
strategy in which colonies were grown on filters at 
neutral pH to produce subtil isin and filters 

5 subsequently transferred to casein plates at pH 11.5 
to assay subtilisin activity. However, at pH 11,5 the 
casein micells no longer formed a turbid background 
and thus prevented a clear observation of proteolysis 
halos. The problem was overcome by briefly staining 

10 the plate with Coomassie blue to amplify proteolysis 
zones and acidifying the plates to develop casein 
micell turbidity. By comparison of the halo size 
produced on the reference growth plate (pH 7) to the 
high pH plate (pH 11.5), it was possible to identify 
!5 mutant subtilisins that had increased (positives) or 
decreased (negatives) stability under alkaline 
conditions. 



Roughly 1000 colonies were screened from each of the 
20 four misincorporation libraries. The percentage of 
colonies showing a differential loss of activity at pH 
11.5 versus pH 7 represented 1.4, 1.8, 1.4, and 0.6% 
of the total colonies screened from the thiol dGTPas, 
dATPttS, dTTPas, and dCTPas libraries, respectively. 
25 Several of these negative clones were sequenced and 
all were found to contain a single base change as 
expected from the misincorporation library from which 
they came. Negative mutants included A3 6, £170 and 
V50. Two positive mutants were identified as V107 and 
30 R213. The ratio of negatives to positives was roughly 
50:1. 
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3. Stability and Activity of 

Subtil i sin Muta nts at Alkaline pfl 

Subtilisin mutants were purified and their autolytic 
stabilities were measured by the time course of 
inactivation at pH 12.0 {Pigs. 32 and 33). Positive 
mutants identified from the screen (i.e., V107 and 
R213) were more resistant to alkaline induced 
autolytic inactivation compared to wild-type; negative 
mutants (i.e., E170 and V50) were less resistant. We 
had advantageously produced another mutant at position 
50 (F50) by site-directed mutagenesis. This mutant 
was more stable than wild-type enzyme to alkaline 
autolytic inactivation (Fig. 33) At the termination 
of the autolysis study, SDS-PAGE analysis confirmed 
that each subtilisin variant had autolyzed to an 
extent consistent with the remaining enzyme activity. 

The stabilizing effects of V107, R213, and F50 are 
cumulative. See Table XXI. The double mutant, 
V107/R213 (made by subcloning the 920 bp Eco Ri-Kpnl 
fragment of pB0180V107 into the 6.6 kb EcoRI-Kpnl 
fragment of pB0l80R213), is more stable than either 
single mutant. - The triple mutant, F50/V107/R213 (made 
by subcloning the 735 bp EcoRl-Pvuii- fragment of pFSO 
(Example 2) into the 6.8 kb EcoRI-PvuH fragment of 
PB0180/V107, is more stable than the double mutant 
V107/R213 or F50. The inactivation curves show a 
biphasic character that becomes more pronounced the 
more stable the mutant analyzed. This may result from 
some destablizing chemical modification (s) (eg., 
deamidation) during "the autolysis study and/or reduced 
stabilization caused by complete digestion of larger 
autolysis peptides. These alkaline autolysis studies 
have been repeated on separately purified enzyme 
batches with essentially the same results. Rates of 
autolysis should depend both on the conformational 
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F. Random Cassette Mutagenesis 
of Residues 197 through 22 8 

plasmid pA222 (Wells, et al . (1985) Gene 34 , 315-323) 
was digested with Pst I and Bam Hl and the 0.4 kb 
Pst l/ Bam HI fragment (fragment 1, see Fig. 34) purified 
from a polyacrylamide gel by electroelution. 
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The 1.5 kb jEcoRI/BamHI fragment from pS4.5 was cloned 
into Ml3mp9. Site directed mutagenesis was performed 
to create the A197 mutant and simultaneously insert a 
silent SstI site over codons 195-196, The mutant 
EcoR I/ BamH I fragment was cloned back into pBS42. The 
pA197 plasmid was digested with BamH I and Sst I and the 
5.3 kb BamH I/ Sst I fragment (fragment 2) was purified 
from low melting agarose. 

Complimentary oligonucleotides were synthesized to 
span the region from Sst I (codons 195-196) to Pst I 
(codons 228-23 0) . These oligodeoxynucleotides were 
designed to (1) restore codon 197 to the wild type, 
(2) re-create a silent Kon l site present in pa 22 2 at 
codons 219-22 0, (3) create a silent Sma l site over 
codons 210-211, and (4) eliminate the Pst I site over 
codons 228-230 (see Fig. 35) . Oligodeoxynucleotides 
were synthesized with 2% contaminating nucleotides at 
each cycle of synthesis, e.g., dATP reagent was spiked 
with 2% dCTP, 2% dGTP, and 2% dTTP. For 97-mers, this 
2% poisoning should give the following percentages of 
non-mutant, single mutants and double or higher 
mutants per strand with two or more mis incorporations 
per complimentary strand: 14% non-mutant, 28% single 
mutant, and 57% with ^2 mutations, according to the 
general formula 

n 

35 f *■ ■ — e-M • 

nl 
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where p is the average number of mutations and n is a 
number class of mutations and f is the fraction of the 
total having that number of mutations. Complimentary 
oligodeoxynucleotide pools were phosphorylated and 
5 annealed {fragment 3) and then ligated at 2 -fold molar 
excess over fragments 1 and 2 in a three-way ligation. 



E. coli MM294 was transformed with the ligation 
reaction / the transformation pool - grown up over night 
and the pooled plasmid DNA was isolated* This pool 
represented 3.4 x 10 4 independent transformants. This 
plasmid pool was digested with Pst I and then used to 
"transform E . coli . A second plasmid pool was 
prepared and used to transform B. - subtil is (BG2036) . 
Approximately 40% of the BG2036 transformants actively 
expressed subtil isin as judged by halo-clearing on 
casein plates. Several of the non-expressing 
transformants were sequenced and found to have 
insertions or deletions in the synthetic cassettes. 
Expressing BG203 6 mutants were arrayed in microtiter 
dishes with 150^1 of LB/12. 5/ig/mL chloramphenicol 
(cmp) per well, incubated at 37 # C for 3-4 hours and 
then stamped in duplicate onto nitrocellulose filters 
laid on LB 1.5% skim milk/5/jg/mL cmp plates and 
incubated overnight at 3 3*C (until halos were 
approximately 4-8 mm in diameter) . Filters were then 
lifted to stacks of filter paper saturated with 
1 x Tide commercial grade detergent, 50 aM Na 2 C0 3 , pH 
11.5 and incubated at 65 *C for 90 min. Overnight 
growth plates were Commassie stained and destained to 
establish basal levels of expression. After this 
treatment, filters were returned to pH7/skim 
ailk/20^g/mL tetracycline plates and incubated at 37 *C 
for 4 hours to overnight. 
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Mutants identified by the high pH stability screen to 
be more alkaline stable were purified and analyzed for 
autolytic stability at high pH or high temperature. 
The double mutant C2 04/R213 was more stable than wild 
type at either high pH or high temperature (Table 
XXII). 

This mutant was dissected into single mutant parents 
(C204 and R213) by cutting at the unique Smal 
restriction site (Fig. 35) and either ligating wild 
type sequence 3 1 to the Smal site to create the single 
C204 mutant or ligating wild type sequence 5' to the 
Smal site to create the single R213 mutant. Of the 
two single parents, C204 was nearly as alkaline stable 
as the parent double mutant (C04/R213) and slightly 
more thermally stable. See Table XXII. The R213 
mutant' was only slightly more stable than wild type 
under both conditions (not shown) • 

Another mutant identified from the screen of the 197 
to 22B random cassette mutagenesis was R204. This 
mutant was more stable than wild type at both high pH 
and high temperature but less stable than C204. 
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Stability of subtilisin variants 

5 Purified enzymes {2 00^g/mL) were incubated in 0.1M 
phosphate, pH 12 at 30 "C for alkaline autolysis, or in 
2mM CaCl 2 , 50mM MOPS , pH 7.0 at 62 *C for thermal 
autolysis. At various times samples were assayed for 
residual enzyme activity. Inactivations were roughly 

10 pseudo-first order, and t 1/2 gives the time it took 
to reach 50% of the starting activity in two separate 
experiments . 
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t 1/2 t 1/2 

(alkaline (thermal 



25 



Subtilisin variant 


autolysis) 
Exp . Exp , 

#r #2 


autolysis) 
Exp. Exp. 
#1 #2 


wild type 


30 


25 


20 


23 


F50/V107/R213 


49 


41 


18 


23 


R204 


35 


32 


24 


27 


C204 


43 


46 


38 


40 


C204/R213 


50 


52 


32 


36 


L2 04/R213 


32 


30 


20 


21 



G. Random Mutagenesis at Codon 204 

3 0 Based on the above results, codon 204 was targeted for 
random mutagenesis. Mutagenic DNA cassettes (for 
codon at 204) all contained a fixed R213 mutation 
which was found to slightly augment the stability of 
the C204 mutant. 
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Plasmid DKA encoding the subtilisin mutant C2 04/R213 
was digested with SstI and EcoRI and a 1.0 kb 
EcoRI/ Sst I fragment was isolated by electro-elution 
from polyacrylamide gel (fragment 1, see Fig. 3 5) . 

5 

C204/R213 was also digested with Sma l and EcoRI and 
the large 4.7 kb fragment, including vector sequences 
and the 3 1 portion of coding region, was isolated from 
low melting agarose (fragment 2, see Fig. 36) . 

10 

Fragments 1 and 2 were combined in four separate 
three-way ligations with heterophosphorylated 
fragments 3 (see Figs. 36 and 37). This hetero- 
phosphorylation of synthetic duplexes should 
15 preferentially drive the phosphorylated strand into 
the plasmid ligation product. Four plasmid pools, 
corresponding to the four ligations, were restricted 
With Sma l in order to linearize any single cut 
C2 04/R213 present from fragment 2 isolation, thus 
20 reducing the background of C204/R213. coli was 

then re-transformed with Smal-restricted plasmid pools 
to yield a second set of plasmid pools which are 
essentially free of C2 04/R213 and any non-segregated 
heterduplex material. 

25 

These second enriched plasmid pools were then used to 
transform subtilis (BG203 6) and the resulting four 
mutant pools were screened for clones expressing 
subtilisin resistant to high pK/temperature 
30 inactivation. Mutants found positive by such a screen 
were further characterized and identified by 
sequencing . 

The mutant L2 04/R213 was found to be slightly more 
35 stable than the wild type subtilisin. See Table XXII. 
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Having described the preferred embodiments of the 
present invention, it will appear to those ordinarily 
skilled in the art that various modifications may be 
made to the disclosed embodiments, and that such 
modifications are intended to be within the scope of 
the present invention. 
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CLAIMS; 

1. A carbonyl hydrolase mutant having at least one 
property which is substantially , different from the 
same property of a precursor carbonyl hydrolase from 
which the amino acid sequence of said carbonyl 
hydrolase mutant is derived, said property being 
selected from the group consisting of thermal 
stability and alkaline stability wherein said 
precursor carbonyl hydrolase is selected from the 
group consisting of naturally occurring carbonyl 
hydrolases and recombinant carbonyl hydrolases and 
said carbonyl hydrolase mutant amino acid sequence is 
derived by a method selected from the group consisting 
of the substitution, deletion and insertion of at 
least one amino acid in said amino acid sequence of 
said precursor carbonyl hydrolase. 

2, A carbonyl hydrolase mutant having at least one 
property which is substantially different from the 
same property of a precursor carbonyl hydrolase from 
which the amino acid sequence of said carbonyl 
hydrolase mutant is derived, said property being 
selected from the group consisting of oxidative 
stability, substrate specificity, catalytic activity, 
thermal stability, alkaline stability and pH activity 
Profile wherein said precursor carbonyl hydrolase is 
selected from the group consisting of naturally 
occurring carbonyl hydrolases and recombinant carbonyl 
hydrolases and said carbonyl hydrolase mutant amino 
acid sequence is derived by a method selected from the 
group consisting of deletion and insertion of at least 
one amino acid in said amino acid seguence of said 
precursor carbonyl hydrolase and substitution of more 
than one amino acid residue of said amino acid 
sequence of said precursor carbonyl hydrolase* 



-131- 



0251 446 



3. A carbonyl hydrolase mutant derived by the 
replacement of at least one amino acid residue of a 
precursor carbonyl hydrolase with a different amino 
acid, said one amino acid residue being selected from 

5 the group of amino acid residues of Bacillus 
amyloliquef aciens subtilisin consisting of Tyr21, 
Thr22, Ser24, Ser33, Asp36, Ala45, Gly46 / Aia48, 
Ser49, Met50, Asn77, Ser87, Lys94, Val95, Leu96, 
Tyrl04, Ilel07 r GlyllO, Metl24 # Asnl55, Glul56 # 

10 Lysl70, Tyrl71, Prol72, Phel89, Aspl97, Metl99, 
Ser204, Lys213 / Tyr217, Ser221 # His67, Leul26, Leul35, 
Gly97, Asp99, SerlOl, Glyl02, Glul03, Leul26, Glyl27, 
Glyl28, Prol29 r Tyr214, Gly215 r and equivalent amino 
acid residues in other precursor carbonyl hydrolases. 

15 

4. A carbonyl hydrolase mutant having an amino acid 
sequence derived from the amino acid sequence of a 
precursor carbonyl hydrolase by the substituion of a 
differnt amino acid for more than one amino acid 

20 residue' of said amino acid sequence of said precursor 
carbonyl hydrolase, said amino acid ._ residues being 
selected from the group of amino acid residues of 
Bacillus amyloliquef aciens subtilisin consisting of 
Tyr21, Thr22, Ser24 f Asp32, Ser33, Asp36, Ala45, 

25 Gly46 # Ala48, Ser49, Met5G, Asn77, SerB7, Lys94, 
Val95, Leu96, Tyrl04, Ilel07, GlyllO, Metl24, Alal52 / 
Asn-155, Glul56, Glyl66, Glyl69, Lysl70, Tyrl71, 
Frol72, Phel89, Aspl97 r Metl99 f Ser204, Lys213, 
Tyr217, Ser221, Met222, His67, Leul26, Leul35, Gly97, 

30 Asp99, SerlOl, Glyl02, Glul03, Leul26, Glyl27, Glyl28, 
Prol29, Tyr214, Gly215, and equivalent amino acid 
residues in other precursor carbonyl hydrolases. 
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5 m The mutant of Claim 4 wherein said combinations 
are selected from the group consisting of Thr22/Ser87, 
Ser24/Ser87, Ala45/Ala4B / Ser49/Lys94, Ser49/Val95, 
Met50/Val95, MetSO/GlyllO , Met50/Metl24 , Met50/Met222, 
Met 1 2 4 /Met 2 22 , Glul 5 6 /Glyl 6 6 , Glu 156/ Gly 1 6 9 , 

Glyl66/Met222, Glyl69/Met222 , Tyr21/Thr22 , 

Met50/Metl24/Met222 , Tyr21/Thr22/Ser87, Met50/Glul5 6/ 
,Glyl66/Tyr217, Met50/Glul56/Tyr217 , Glul56/Glyl69/ 
Tyr217, Ilel70/Lys213 , Ser204/Lys213 , Met50/Ilel07/ 
Lys213 and Ser24/Met50/Ilel07/Glul56/Glyl66/Glyl69/ 
Ser204/Lys213/Gly215/Tyr217. 

6. A carbonyl hydrolase mutant derived by the 
replacement of at least one amino acid residue of a 
precursor carbonyl hydrolase with a different amino 
acid, said one amino acid residue being selected from 
the group of amino acid residues of of Bacillus 
amyloligue faciens subtilisin consisting of Tyr21, 
Thr22, Ser24, Asp32, Ser33, Asp36, Ala45, Gly46, 
Ala48, Ser49 f Met50 r Asn77, Ser87, Lys94, Val95 f 
Leu96, Tyrl04, Ilel07, GlyllO, Metl24 , Alal52, 
Asn-155, Glul56, m Glyl66, Glyl69, Lysl70, Tyrl71, 
Prol72, Phel89, Aspl97, Metl99 f Ser204, Lys213, 
Tyr217, Ser221, Met222 f His67, Leul26, Leul35 f Gly97, 
Asp99, SerlQl, Glyl02, Glul03, Leul26, Glyl27, Glyl28, 
Prol29, Tyr214, Gly215, and equivalent amino acid 
residues in other precursor carbonyl hydrolases, 
wherein said at least one amino acid residue of said 
precursor carbonyl hydrolase is replaced with the 
amino acid residues listed in TABLE I and TABLE II 
herein . 

7, The mutant of Claim 6 wherein the amino acid 
replacing said at least one amino acid residue in said 
precursor carbonyl hydrolase is selectd from the 
replacement amino acids listed in TABLE I herein. 
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8. Mutant DNA sequence encoding the mutant of claims 
1 through 7 . 



9- Expression vector containing the mutant DNA 
sequence of claim 8. 

5 

10, Host cell .transformed with the expression vector 
of Claim 9. 

10 
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P RBS ^- PRE V PRO MAT TERM 



? p ? ? 



RBS 



-in? 

1 ^JJ^ACTAAAATATTATTCCA T^CTAT ACAATTAATACACASAATMTCTSTCT^I TBGTTATTCTGCAAATGAAAAAAAG GAEAGG ATAAAftA GTG 

-inn PRE -on 

Arq Gly Lys Lys Yal Trp lie Ser Leu Leu Phe Ala Leu Ala Leu He Phe Thr Met Ala Phe Gly Ser Thr Srr 

99 AAA GGC AAA AAA GTA TGG ATC AG7 TTG CTG TTT GCT TTA GCG TTA ATC TTT ACG ATE GCG TTC GGC ARC AC* TCC 

-BD -70 PRO -60 

Ser Ala Gin Ala Ala Gly Lvs Ser Asn Glv Glu Lvs Lys Tyr I le Val Gly Phe lys Gin Thr Met Ser Thr -Met 
174 TCT GCC CAG GCG GCA GGG AAA TCA AAC GGG GAA AAG AAA TAT ATT GTC GGG TTT AAA CAG ACA ATG AGC ACG ATG 

-50 -40 
Ser Ala Ala Lys Lys Lys Asp Val Tie Ser Glu Lys G1y Gly Lys Val Gin Lvs Gin Phe Lrs Tyr Val Asp Ala 
249 AGC GCC GCT AAG AAG AAA GAT GTC ATT TCT GAA AAA fifiC fiftfi AAA GTG CAA AAG CAA TTC AAA TAT GTA GAC GCA 

-30 -20 -10 

Ala Ser Ala Thr. Leu Asn Glu Lys Ala Val Lys Glu Leu Lvs Lys Asp Pro Ser Val Ala Tyr Val Glu Glu *so 
324 GCT TCA GCT ACA TTA AAC GAA AAA GCT GTA AAA GAA TTG AAA AAA GAC CCG AGC GTC GCT TAC GTT GA* GAA GAT 

-irr* MAT 

His Val Ala His Ala Tyr Mi Gin Ser Val Pro Tyr Gly Val Ser Gin Me Lys Ala Pro Ala Leu His Ser Gin 
399 CAC GTA GCA CAT GCG TAC GCG CAG TCC GTG CCT TAC GGC GTA TCA CAA ATT AAA GCC CCT GCT CTG CAC TCT CAA 
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Asp Ala 100 110 
Val Leu Gly Ala Asp Gly Ser Gly-Gln Tyr Ser Trp lie Tie Asn Gly He Glu Trp Ala He Ala Asn Asn **t 
699 GTT CTC GGT GCT GAC GGT TCC GGC CAA TAC AGC TGG ATC ATT AAC GGA ATC GAG TGC GCG ATC- GCA AAC AftT ATG 

120 130 140 

Asp Val lie Asn Met Ser Leu Gly Glv Pro Ser Glv Ser Ala Ala Leir Lvs Ala Ala Val Asn lvs Ala v*l 41a 
774 GAC GTT ATT AAC ATG AGC CTC GGC GGA CCT TCT GGT TCT GCT GCT TTA AAA GCG GCA GTT GAT AAA GCC GTT GCA 

150 Ser Thr 160 

Ser Gly Val Val Val Val AT a Ala Ala Gly Asn Glu Gly Thr Ser Gly Srr Srr Ser Thr Val Gly Tvr Pro Glv 
B49 TCC GGC GTC GTA GTC GTT GCG GCA GCC GGT AAC GAA GGC ACT TCC GGC AGC TCA AGC ACA GTG GGC TAC fCT GGT 

170 1 80 1 90 

Lvs Tyr Pro Ser Val lie Ala Val filv Ala Val Asn Ser Ser Asn Gin Am Ala Ser Phe Ser Ser Val Gly Prn 
924 AAA TAC CCT TCT GTC ATT GCA GTA GGC GCT GTT GAC AGC AGC AAC CAA AG A GCA TCT TTC TCA ARC GTA GGA CCT 

?00 tlO 
Glu Leu Asp Val Met Ala Pro Gly Val Ser Tie Gin Ser Thr Leu Prn Gly Asn Lvs Tyr Gly Ala Tvr Aj n Glv 
999 GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA AGC ACG CTT CCT GGA AAC AAA TAC GGG GCG TAC AAC GGT 

??0 230 ?40 

Thr Ser Met Ala Ser Pro His Val Ala Gly Ala Ala Ala Le« Tie Leu Ser Lvs His Pro Asn Trp Thr Asn Thr 
1074 ACG TCA ATG GCA TCT CCG CAC GTT GCC GGA GCG GCT GCT TTG ATT CTT TCT AAG CAC CCG AAC TGG ACA AAC ACT 

250 Gin 260 
Gin Val Aro Ser Ser Leu Glu Asn Thr Thr Thr Lvs Leu Gly Asp Ser Phe Tyr Tyr Glv Lvs Glv Leu lie Asn 
11«9 CAA GTC CGC AGC AG! TTA GAA AAC ACC ACT ACA AAA CTT GGl GAT TCT TTC TAC TAT GGA AAA GGG CTG ATC AAC 

Val Gin Ala Ala Ala G^n OC TERM 
1224 GTA CAG GCG GCA GCT CAG TAA AAC ATAAAAAACCGGC CTTGGCCCC GCrGGTlTTTlATT A lTTTT CTTCCTCCGCATGTTCAATCCGCTCC 

1316 ATAATCGACGGATGGCTCCCTCTGAAAATTTTAACGAGAAACGRCGGGTTGACCCGGCTCAGTCCCGTAACGGCCAAGTCCTGAAACGTCTCAATCGCCG 

1*16 CnCCCGGTTTCCGGTCAGCTCAATGCCGTAACGGTCGGCGGCGTTTTCCTGATACCGGGAGACGGCATTCGTAAKGGATC P I 
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HoholoQy of fiielUui protease* 

1. Bacillus •nylollqulfflclflna 

2. Baeillus lufatllli vir.ItSB 
3.6acillu» lichtnifornit (earlibergeni It ) 
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TOTALLY CONSERVED RESIDUES IN SUBTJLISIN6 
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195 2C0 206 

W.TA.A-: Glu Leu Asp Val Met Ala Pro Gly Val Ser lie Gin 

WT. DNA: GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA 
CTC GAA CIA CAG TAC CGT GGA CCG CAT AGA TAG GTT 

pA222DNA: GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA 
CTC GAA CTA CAG TAC CGT GGA CCG CAT AGA TAG GTT 

A197DNA: GAG CTC GCA GTC ATG GCA CCT GGC GTA TCT ATC CAA 
CTC GAG CGT CAG TAC CGT GGA CCG CAT AGA TAG GTT 
Sstl 

Fr*gJB«tJ from GAG-CT 
pA222w>dA197 Cp 
cut w/ Fstl, Ssih 



pA77?, A 197 GAG CTC GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA 

cut&Iigatoi CTC GftS,.CJft.Cftg TAC CGT GGA CCG CAT ftSft TAG GTT 
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oucleotide pools: 

207 21a 218 
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TCG TGC GAA GGA CCT TTG TTT ATG CCC CGC ATG TTG 
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f TCG TGC GAA GGA CCT TTG TTT ATG CCC CGC ATG TTG 
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PA222DNA: SET ACC TCA CG CAC GCT. CCA „gGA GCG-3' 
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ni 



FIG.- 



0251 446 



Sstl^^smal 



213 



EcoRI 



pBR322 

ori k'z 



BamHI 




1. Ssil/EcoRI digestion 



2. Purify 1.0 kb EcoRI/Sst! 
fragment 



Fragment 1 



CAT 

1. Smat/EooRI digestion 

2. Purify 4.7 kb EcoRI/Smal 
fragment 





Fragment 2 



p 01 

#2 



#1 

5asZ 5 3 



#2 
#3 

#3 



Four 3-way ligations 



#4 



Fragments 3 

Heterophosphorylaled 
duplexes 

(see Fig. 4) 




1. Transform 



2. Digest 4 plasmid pools with Smal, 
retransfomi E. coli 

3. Transform B. subtilis (BG2036) 
with 4 second pools 

4. Screen 



FIG.— 36 



0251 446 



C 



13 
o p 

CD 

In 

JCJ 
Eh 

M 

OJ 
CQ 

P 
•H 

CD 



o <D 



td 
> 

•H 
O 

o 

Ph 



4J 



> 

a 



ON 1 "H 




U CD 



cd u 
a cd 

<j o 
u a 

H < 
+ CD O 
H < 



cd 



Eh 



U CD 

CD CJ 

cd o 

£h < 

o a 

u CD 

< Eh 
CJ CD 
CD CJ 

g£ 

< E-i 

CJ CD 

H < 

CD U 

Eh < 

< H 
CD CJ 

*CJ| cd 

*H 
u 

CD 
< 

i 



+ id u 

y cs 

CD O 

CD CJ 

CD u 



to m 
1 i 

< H 

* CD U 

< Eh 

CJ CD 
5 |h 

* a cj 

CD O J 



B 

CJ cDj co 



Eh <| 

U CD 



CD 
U 



CDJ 

CJ CD 

CJ CD 
Eh < 



< Eh 







EH 


Eh 


CD 




CJ 














CO 


a 












(J 




CD U 


cS 


i 




1 t 










m 



cd a 

CD CD 
CD CJ 
CD U 

H < 
O CD 
CJ CD 

< Eh 

a cd 

CD CJ 
CD (J 

< H 

CJ CD 
Eh < 
CJ CJ 

Eh <l 

< H 
CD a 

CJ( CD 
< 
CD 

a 

Eh 
CJ 



o 




to 

! 



o 



CO 



